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Use of Nuclear Magnetic Resonance to Identify 
Ligands to Target Biomolecules 

Technical Field of the Invention 
5 The present invention pertains to a method for the screening of compounds 

for biological activity and to the determination of binding dissociation constants 
using two-dimensional *~W*H NMR correlation spectral analysis to identify and 
design ligands that bind to a target biomolecule. 

10 Background of the Invention 

One of the most powerful tools for discovering new drug leads is random 

screening of synthetic chemical and natural product databases to discover 

compounds that bind to a particular target molecule (i.e., the identification of 

ligands of that target). Using this method, ligands may be identified by their ability 
15 to form a physical association with a target molecule or by their ability to alter a 

function of a target molecule. 

When physical binding is sought, a target molecule is typically exposed to 

one or more compounds suspected of being ligands and assays are performed to 

determine if complexes between the target molecule and one or more of those 
20 compounds are formed. Such assays, as is well known in the art, test for gross 

changes in the target molecule (e.g., changes in size, charge, mobility) that indicate 

complex formation. 

Where functional changes are measured, assay conditions are established 

that allow for measurement of a biological or chemical event related to the target 
25 molecule (e.g., enzyme catalyzed reaction, receptor-mediated enzyme activation). 

To identify an alteration, the function of the target molecule is determined before 

and after exposure to the test compounds. 

Existing physical and functional assays have been used successfully to 

identify new drug leads for use in designing therapeutic compounds. There are, 
30 however, limitations inherent to those assays that compromise their accuracy, 

reliability and efficiency. 

A major shortcoming of existing assays relates to the problem of "false 

positives". In a typical functional assay, a "false positive 1 * is a compound that 

triggers the assay but which compound is not effective in eliciting the desired 
35 physiological response. In a typical physical assay, a "false positive" is a 

compound that, for example, attaches itself to the target but in a non-specific 

manner (e.g M non-specific binding). False positives are particularly prevalent and 
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problematic when screening higher concentrations of putative ligands because 
many compounds have non-specific affects at those concentrations. 

In a similar fashion, existing assays are plagued by the problem of "false 
negatives", which result when a compound gives a negative response in the assay 

5 but which compound is actually a ligand for the target. False negatives typically 
occur in assays that use concentrations of test compounds that are either too high 
(resulting in toxicity) or too low relative to the binding or dissociation constant of 
the compound to the target 

Another major shortcoming of existing assays is the limited amount of 

10 information provided by the assay itself. While the assay may correctly identify 
compounds that attach to or elicit a response from the target molecule, those assays 
typically do not provide any information about either specific binding sites on the 
target molecule or structure activity relationships between the compound being 
tested and the target molecule. The inability to provide any such information is 

15 particularly problematic where the screening assay is being used to identify leads 
for further study. 

It has recently been suggested that X-ray crystallography can be used to 
identify the binding sites of organic solvents on macromolccules. However, this 
method cannot determine the relative binding affinities at different sites on the 
20 target. It is only applicable to very stable target proteins that do not denature in the 
presence of high concentrations of organic solvents. Moreover, this approach is 
not a screening method for rapidly testing many compounds that are chemically 
diverse, but is limited to mapping the binding sites of only a few organic solvents 
due to the long time needed to determine the individual crystal structures. 

25 Compounds are screened to identify leads that can be used in the design of 

new drugs that alter the function of the target biomolecule. Those new drugs can 
be structural analogs of identified leads or can be conjugates of one or more such 
lead compounds. Because of the problems inherent to existing screening methods, 
those methods are often of little help in designing new drugs. 

30 There continues to be a need to provide new, rapid, efficient, accurate and 

reliable means of screening compounds to identify and design ligands that 
specifically bind to a particular target 
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Brief Summary of the Invention 
In one aspect, the present invention provides a process of screening 
compounds for biological activity to identify ligands that bind to a specific target 
molecule. That process comprises the steps of: a) generating a first two- 
5 dimensional 15 N/*H NMR correlation spectrum of a 15 N-labeled target molecule; 

b) exposing the labeled target molecule to one or a mixture of chemical compounds; 

c) generating a second two-dimensional 15 N/*H NMR correlation spectrum of the 

labeled target molecule that has been exposed to one or a mixture of compounds in 

15 1 

step (b); and d) comparing said first and second two-dimensional N/ H NMR 

10 correlation spectra to determine differences between said first and said second 

spectra, the differences identifying the presence of one or more compounds that are 
ligands which have bound to the target molecule. 

Where the process of the present invention screens more than one 
compound in step (b), that is, a mixture of compounds, and where a difference 

15 between the first spectrum generated from the target molecule alone and that 

generated from the target molecule in the presence of the mixture, additional steps 
are performed to identify which specific compound or compounds contained in the 
mixture is binding to the target molecule. Those additional steps comprise the 
steps of e) exposing the l5 N-labeIed target molecule individually to each compound 

20 of the mixture, 0 generating a two-dimensional ^N/^H NMR correlation 

spectrum of the labeled target molecule that has been individually exposed to each 
compound; and g) comparing each spectrum generated in step f) to the first 
spectrum generated from the target molecule alone to determine differences in any 
of those compared spectra, the differences identifying the presence of a compound 

25 that is a ligand which has bound to the target molecule. 

15 1 

Because the chemical shift values of the particular N/ H signals in the 
two-dimensional correlation spectrum correspond to known specific locations of 
atomic groupings in the target molecule (e.g., the N-H atoms of the amide or 
peptide link of a particular amino acid residue in a polypeptide), the process of the 

30 present invention allows not only for the for identification of which compound(s) 
bind to a particular target molecule, but also permit the determination of the 
particular binding site of the ligand on the target molecule. 

In a second aspect, the present invention provides a process of determining 
the dissociation constant, Kd> for a given ligand and its target molecule. That 

35 process comprises the steps of a) generating a first two-dimensional ^N/^H NMR 
correlation spectrum of a l5 N-labeled target molecule; b) exposing the labeled 
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target molecule to various concentrations of a ligand; c) generating a two- 
dimensional *~W*H NMR correlation spectrum at each concentration of ligand in 
step (b); d) comparing each spectrum from step (c) to the first spectrum from step 
(a); and e) calculating the dissociation constant between the target molecule and the 
5 ligand from those differences according to the equation: 

K D = ([P] 0 -x)([L] 0 -x) 
x 

An advantageous aspect of the present invention is the capability of the 
process of the present invention to determine the dissociation constant of one 
10 ligand of the target molecule in the presence of a second molecule already bound to 
the ligand. This is generally not possible with prior art methods which employ 
"wet chemical" analytical methods of determining binding of a ligand to a target 
molecule substrate. 

In this preferred embodiment, the process of determining the dissociation 
15 constant of a ligand can be performed in the presence of a second bound ligand. In 
accordance with this embodiment, the ^N-labeled target molecule is bound to that 
second ligand before exposing that target to the test compounds. 

The ability of the present method to determine not only the existence of 
binding between one ligand and the target molecule, but also the particular site of 
20 binding in the presence of a second bound ligand permits the capability to design a 
drug that comprises two or more linked moieties made up of the ligands. 

This method uses the two-dimensional *^N/*H NMR correlation 
spectroscopic screening process as set forth above to identify a first and 
subsequent ligands that bind to the target molecule. A complex of the target 
25 molecule and two or more ligands is formed and the three-dimensional structure of 
that complex is determined preferably using NMR spectroscopy or X-ray 
crystallography. That three-dimensional structure is used to determine the spatial 
orientation of the ligands relative to each other and to the target molecule. 

Based on the spatial orientation, the ligands are linked together to form the 
30 drug. The selection of an appropriate linking group is made by maintaining the 
spatial orientation of the ligands to one anotherand to the target molecule based 
upon principles of bond angle and bond length information well known in the 
organic chemical art. 
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Thus, the molecular design method comprises identifying a first ligand 
moiety to the target molecule using two-dimensional *"W*H NMR correlation 
spectroscopy; identifying subsequent ligand moieties to the target molecule using 
two-dimensional *~W*H NMR correlation spectroscopy; forming a complex of 
the first and subsequent ligand moieties to the target molecule; determining the 
three dimensional structure of the complex and, thus, the spatial orientation of the 
first and subsequent ligand moieties on the target molecule; and linking the first and 
subsequent ligand moieties to form the drug to maintain the spatial orientation of 
the ligand moieties. 

The identification of subsequent ligand moieities can be performed in the 
absence or presence of the first ligand (e.g., the target molecule can be bound to 
the first ligand before being exposed to the test compounds for identification of the 
second ligand). 

In a preferred embodiment, the target molecule used in a screening or 
design process is a polypeptide. The polypeptide target is preferably produced in 
recombinant form from a host cell transformed with an expression vector that 
contains a polynucleotide that encodes the polypeptide, by culturing the 
transformed host cell in a medium that contains an assimilable source of l5 N such 
that the recombinantly produced polypeptide is labeled with . 

Brief Description of the Drawings 
In the drawings which form a portion of the specification: 
FIG. 1 shows a 15 N/*H correlation spectmm of the DNA binding domain of 
uniformly ^N-labeled human papillomavirus E2. The spectrum (80 
complex points, 4 scans/fid) was acquired on a 0.5 mM sample of E2 in 20 
mM phosphate (pH 6.5), 10 mM dithiothreitol (DTT) and 10% deuterium 
oxide (D2O). 

FIG. 2 shows 15 N/*H correlation spectra of the DNA binding domain of 

uniformly ^N-labeled human papillomavirus E2 before (thin multiple 
contours) and after (thick single contours) addition of a final test 
compound. The final concentration of compound was LO mM. All other 
conditions are as stated in FIG. 1. Selected residues that show significant 
changes upon binding are indicated. 
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FIG. 3 shows ^N/ l H correlation spectra of the DNA binding domain of 

uniformly ^N-labeled human papillomavirus E2 before (thin multiple 
contours) and after (thick single contours) addition of a second test 
compound. The final concentration of compound was 1.0 mM. AH other 
5 conditions are as stated in FIG. 1. Selected residues that show significant 

changes upon binding are indicated. 
FIG. 4 shows *^N/*H correlation spectra of the catalytic domain of uniformly 
^N-labeled stromelysin before (thin multiple contours) and after (thick 
single contours) addition of a test compound. The final concentration of 
10 m compound was 1.0 mM. The spectra (80 complex points, 8 scans/fid) 

were acquired on a 03 mM sample of SCD in 20 mM TRIS (pH 7.0), 20 
mMCaCl2and 10%D2O. Selected residues that show significant changes 
upon binding are indicated. 
FIG. 5 shows *^N/*H correlation spectra of the Ras-binding domain of uniformly 
15 ^N-labeled RAF peptide (residues 55-132) before (thin multiple contours) 

and after (thick single contours) addition of a test compound. The final 
concentration of compound was 1.0 mM. The spectra (80 complex points, 
8 scans/fid) were acquired on a 0.3 mM sample of the RAF fragment in 20 
mM phosphate (pH 7.0), 10 mM DTT and 10% D2O. Selected residues 
20 that show significant changes upon binding are indicated. 

FIG. 6 shows *^N/*H correlation spectra of uniformly ^N-labelcd FKBP before 
(thin multiple contours) and after (thick single contours) addition of a test 
compound. The final concentration of compound was 1.0 mM. The 
spectra (80 complex points, 4 scans/fid) was acquired on a 0.3 mM sample 
25 of FKBP in 50 mM phosphate (pH 6.5), 100 mM NaCl and 10% D2O. 

Selected residues that show significant changes upon binding are indicated. 
FIG. 7 shows a first depiction of the NMR-derived structure of the DNA-bmding 
domain of E2. The two monomers of the symmetric dimer are oriented in a 
top-bottom fashion, and the N- and C-termini of each monomer are 
30 indicated (N and C for one monomer, N* and C* for the other). Shown in 

ribbons are the residues which exhibit significant chemical shift chances 

1 15 • - 

(A6( H)>0.04 ppm; A6( N) >0.1 ppm) upon binding to a first test 

compound. These residues correspond to the DNA-recognition helix of 
E2. Selected residues are numbered for aid in visualization. 
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FIG. 8 shows a second depiction of the NMR-derived structure of the DNA- 
binding domain of E2, The two monomers of the symmetric dimer are 
oriented in a top-bottom fashion, and the N- and C-termini of each 
monomer are indicated (N and C for one monomer, N* and C* for the 
5 other). Shown in ribbons are the residues which exhibit significant 

chemical shift changes (A6(*H)>0.04 ppm; A6(^N) >0.1 ppm) upon 
binding to a second test compound. These residues are located primarily in 
the dimer interface region. Selected residues are numbered for aid in 
visualization. 

10 FIG. 9 shows a depiction of the NMR-derived structure of the catalytic domain of 
stromelysin. The N* and C-termini are indicated. Shown in ribbons are 
the residues which exhibit significant chemical shift changes (A6(*H)>0.04 
ppm; A6(*~*N) >0. 1 ppm) upon binding to a test compound. These either 
form part of the SI * binding site or are spatially proximal to this site. 

1 5 Selected residues are numbered for aid in visualization. 

FIG. 10 shows a ribbon plot of a ternary complex of first and second ligands 
bound to the catalytic domain of stromelysin. 

Detailed Description of the Invention 
20 The present invention provides a rapid and efficient screening method for 

identifying ligands that bind to therapeutic target molecules. 

Ligands are identified by testing the binding of molecules to a target 
molecule (e.g., protein, nucleic acid, etc.) by following, with nuclear magnetic 
resonance (NMR) spectroscopy, the changes in chemical shifts of the target 
25 molecule upon the addition of the ligand compounds in the database. 

From an analysis of the chemical shift changes of the target molecule as a 
function of ligand concentration, the binding affinities of ligands for biomolecules 
are also determined. 

The location of the binding site for each ligand is determined from an 
30 analysis of the chemical shifts of the biomolecule that change upon the addition of 
the ligand and from nuclear Overhauser effects (NOEs) between the ligand and 
biomolecule. 

Information about the structure/activity relationships between ligands 
identified by such a process can then be used to design new drugs that serve as 
35 ligands to the target molecule. By way of example, where two or more ligands to a 
given target molecule are identified, a complex of those ligands and the target 
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molecule is formed. The spatial orientation of the ligands to each other as well as 
to the target molecule is derived from the three-dimensional structure. That spatial 
orientation defines the distance between the binding sites of the two ligands and the 
orientation of each ligand to those sites. 
5 Using that spatial orientation data, the two or more ligands are then linked 

together to form a new ligand. Linking is accomplished in a manner that maintains 
the spatial orientation of the ligands to one another and to the target molecule. 

There are numerous advantages to the NMR-based discovery process of the 
present invention. First, because a process of the present invention identifies 
10 ligands by directly measuring binding to the target molecule, the problem of false 
positives is significantly reduced. Because the present process identifies specific 
binding sites to the target molecule, the problem of false positives resulting from 
the non-specific binding of compounds to the target molecule at high 
concentrations is eliminated. 
15 Second, the problem of false negatives is significantly reduced because the 

present process can identify compounds that specifically bind to the target molecule 
with a wide range of dissociation constants. The dissociation or binding constant 
for compounds can actually be determined with the present process. 

Other advantages of the present invention result from the variety and 
20 detailed data provided about each ligand from the discovery process. 

Because the location of the bound ligand can be determined from an 
analysis of the chemical shifts of the target molecule that change upon the addition 
of the ligand and from nuclear Overhauser effects (NOEs) between the ligand and 
biomoleeule, the binding of a second ligand can be measured in the presence of a 
25 first ligand that is already bound to the target. The ability to simultaneously 
identify binding sites of different ligands allows a skilled artisan to 1) define 
negative and positive cooperative binding between ligands and 2) design new 
drugs by linking two or more ligands into a single compound while maintaining a 
proper orientation of the ligands to one another and to their binding sites. 
30 Further, if multiple binding sites exist, the relative affinity of individual 

binding moieties for the different binding sites can be measured from an analysis of 
the chemical shift changes of the target molecule as a function of the added 
concentration of the ligand. By simultaneously screening numerous structural 
analogs of a given compound, detailed structure/activity relationships about ligands 
35 is provided. 
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In its principal aspect, the present invention provides a process of screening 
compounds to identify ligands that bind to a specific target molecule. That process 
comprises the steps of: a) generating a first two-dimensional 15 N/ ! H NMR 
correlation spectrum of a 15 N-Iabeled target molecule; b) exposing the labeled 
target molecule to one or more compounds; c) generating a second two- 
dimensional N/ H NMR correlataon spectrum of the labeled target molecule that 
has been exposed to the compounds of step (b); and d) comparing the first and 
second spectra to determine whether differences in those two spectra exist which 
differences indicate the presence of one or more ligands that have bound to the 
target molecule. 

Where a process of the present invention screens more than one compound 
m step (b) and where a difference between spectra is observed, additional steps are 
performed to identify which specific compound is binding to the target molecules 
Those additional steps comprise generating a two-dimensional 15 N/ ! H NMR 
correlation spectrum for each individual compound and comparing each spectrum 
to the first spectrum to determine whether differences in any of those compared 
spectra exist, which differences indicate the presence of a ligand that has bound to 
the target molecule. 

Any 15 N-Iabeled target molecule can be used in a process of the present 
invention. Because of the importance of proteins in medicinal chemistry a 
preferred target molecule is a polypeptide. The target molecule can be labeled with 
N using any means well known i„ the art. In a preferred embodiment, the target 
molecule is prepared m recombinant form using transformed host cells. In an 
especially preferred embodiment, the target molecule is a polypeptide. Any 
polypeptide that gives a high resolution NMR spectrum and can be partially or 
uniformly labeled with 15 N can be used. The preparation of uniformly 15 N- 
labeled exemplary polypeptide target molecules is set forth hereinafter in the 
Examples. 

A preferred means of preparing adequate quantities of uniformly I5 N- 
labeled polypeptides is to transform a host cell with an expression vector that 
contains a polynucleotide that encodes that polypeptide and culture the transformed 
cell ,n a culture medium that contains assimilable sources of l5 N. Assimilable 
sources of N are well known in the art A preferred such source is 15 NH4CI. 

Means for preparing expression vectors that contain polynucleotides 
encoding specific polypeptides are well known in the art. In a similar manner 
means for transforming host cells with those vectors and means for culturing those 
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transformed cells so that the polypeptide is expressed are also well known in the 
art. 

The screening process of the present invention begins with the generation 
or acquisition of a two-dimensional *^N/*H correlation spectrum of the labeled 
5 target molecule. Means for generating two-dimensional ^N/*H correlation 
spectra are well known in the art (see, e.g., D. A. Egan et aL, Biochemistry . 
32(8): 1920-1927 (1993); Bax, A., Grzesiek, S„ Acc, Chem. Res. . 26(4): 131- 
138 (1993)). 

The NMR spectra that are typically recorded in the screening procedure of 
1 o the present invention are two-dimensional ' ^N/ * H heteronuclear single quantum 
correlation (HSQG) spectra. Because the *^N/*H signals corresponding to the 
backbone amides of the proteins are usually well-resolved, the chemical shift 
changes for the individual amides are readily monitored. 

In generating such spectra, the large water signal is suppressed by spoiling 
15 gradients. To facilitate the acquisition of NMR data on a large number of 

compounds (e.g., a database of synthetic or naturally occurring small organic 
compounds), a sample changer is employed. Using the sample changer, a total of 
60 samples can be run unattended. Thus, using the typical acquisition parameters 
(4 scans per free induction decay (fid), 100-120 HSQC spectra can be acquired in a 
20 24 hour period. 

To facilitate processing of the NMR data, computer programs are used to 
transfer and automatically process the multiple two-dimensional NMR data sets, 
including a routine to automatically phase the two-dimensional NMR data. The 
analysis of the data can be facilitated by formatting the data so that the individual 
25 HSQC spectra are rapidly viewed and compared to the HSQC spectrum of the 
control sample containing only the vehicle for the added compound (DMSO), but 
no added compound. Detailed descriptions of means of generating such two- 
dimensional *~*N/*H correlation spectra are set forth hereinafter in the Examples. 
A representative two-dimensional *^N/*H NMR correlation spectrum of an 
30 ^N-labeled target molecule (polypeptide) is shown in FIG. 1 (the DNA-binding 
domain of the E2 protein). 

Following acquisition of the first spectrum, the labeled target molecule is 
exposed to one or more test compounds. Where more than one test compound is 
to be tested simultaneously, it is preferred to use a database of compounds such as 
35 a plurality of small molecules. Such molecules are typically dissolved in 
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perdeuterated dimethylsulfoxidc . The compounds in the database can be 
purchased from vendors or created according to desired needs. 

Individual compounds can be selected inter alia on the basis of size 
(molecular weight = 100-300) and molecular diversity. Compounds in the 

5 collection can have different shapes (e.g., flat aromatic rings(s), puckered aliphatic 
rings(s), straight and branched chain aliphatics with single, double, or triple 
bonds) and diverse functional groups (e.g., carboxylic acids, esters, ethers, 
amines, aldehydes, ketones, and various heterocyclic rings) for maximizing the 
possibility of discovering compounds that interact with widely diverse binding 

10 sites. 

The NMR screening process of the present invention utilizes ligand 
concentrations ranging from about 0.1 to about 10.0 mM At these concentrations, 
compounds which are acidic or basic can significantly change the pH of buffered 
protein solutions. Chemical shifts are sensitive to pH changes as well as direct 

1 5 binding interactions, and "false positive" chemical shift changes, which are not the 
result of ligand binding but of changes in pH, can therefore be observed. It is thus 
necessary to ensure that the pH of the buffered solution does not change upon 
addition of the ligand. One means of controlling pH is set forth below. 

Compounds are stored at 263°K as 1.0 and 0.1 M stock solutions in 

20 dimethylsulfoxide (DMSO). This is necessary because of the limited solubility of 
the ligands in aqueous solution. It is not possible to directly adjust the pH of the 
DMSO solution. In addition, HC1 and NaOH form insoluble salts in DMSO, so 
alternative acids and bases must be used. The following approach has been found 
to result in stable pHL 

25 The 1 .0 M stock solutions in DMSO are diluted 1 : 10 in 50 mM phosphate, 

pH 7.0. The pH of that diluted aliquot solution is measured. If the pH of the 
aliquot is unchanged (i.e.* remains at 7.0), a working solution is made by diluting 
the DMSO stock solution 1 : 10 to make a 0. 1 M solution and that solution is stored. 
If the pH of the diluted aliquot is less than 7.0, ethanolamine is added to the 

30 1.0 M stock DMSO solution, that stock solution is then diluted 1: 10 with 

phosphate buffer to make another aliquot, and the pH of the aliquot rechecked. 

If the pH of the diluted aliquot is greater than 7.0, acetic acid is added to the 
1.0 M stock DMSO solution, that stock solution is then diluted 1; 10 with 
phosphate buffer to make another aliquot, and the pH of the aliquot rechecked. 
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Ethanolamine and acetic acid are soluble in DMSO, and the proper 
equivalents are added to ensure that upon transfer to aqueous buffer, the pH is 
unchanged. Adjusting the pH is an interactive process, repeated until the desired 
result is obtained. 

5 Note that this procedure is performed on 1: 10 dilutions of 1.0 M stock 

solutions (100 mM ligand) to ensure that no pH changes are observed at the lower 
concentrations used in the experiments (0.1 to 10 mM) or in different/weaker 
buffer systems. 

Following exposure of the ^N-labeled target molecule to one or more test 
to compounds, a second two-dimensional *^N/'h NMR correlation spectrum is 
generated. That second spectrum is generated in the" same manner as set forth 
above. The first and second spectra are then compared to determine whether there 
are any differences between the two spectra. Differences in the two-dimensional 
*^N/*H NMR correlation spectra that indicate the presence of a ligand correspond 
15 to ^N-labeled sites in the target molecule. Those differences are determined using 
standard procedures well known in the art. 

By way of example t FIGs. 2, 3, 4, 5 and 6 show comparisons of 
correlation spectra before and after exposure of various target molecules to various 
test compounds. A detailed description of how these studies were performed can 
20 be found hereinafter in Examples 2 and 3. 

Particular signals in a two-dimensional *^N/*H correlation spectrum 
correspond to specific nitrogen and proton atoms in the target molecule (e.g., 
particular amides of the amino acid residues in the protein). By way of example, it 
can be seen from FIG. 2 that chemical shifts in a two-dimensional *^N/*H 
25 correlation of the DNA -binding domain of E2 exposed to a test compound occurred 
at residue positions 15 (115), 21 (Y21), 22 (R22) and 23 (L23). 

It can be seen from FIG. 2 that the binding of the ligand involved the 
isoleucine (He) residue at position 15 t the tyrosine (Tyr) residue at position 21, the 
arginine (Arg) residue at position 22 and the leucine (Leu) residue at position 23. 
30 Thus, a process of the present invention can also be used to identify the specific 
binding site between a ligand and target molecule. 

The region of the protein that is responsible for binding to the individual 
compounds is identified from the particular amide signals that change upon the 
addition of the compounds. These signals are assigned to the individual amide 
35 groups of the protein by standard procedures using a variety of well-established 
heteronuclear multi-dimensional NMR experiments. 
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To discover molecules that bind more tightly to the protein, molecules arc 
selected for testing based on the structure/activity relationships from the initial 
screen and/or structural information on the initial leads when bound to the protein. 
By way of example, the initial screening may result in the identification of ligands, 

5 all of which contain an aromatic ring. The second round of screening would then 
use other aromatic molecules as the test compounds. 

As set forth hereinafter in Example 2, an initial screening assay for binding 
to the catalytic domain of strbmelysin identified two biaryl compounds as ligands. 
The second round of screening thus used a series of biaryl derivatives as the test 

10 compounds. 

The second set of test compounds are initially screened at a concentration of 
1 mM, and binding constants are measured for those that show affinity. Best leads 
that bind to the protein are then compared to the results obtained in a functional 
assay. Those compounds that are suitable leads are chemically modified to 

15 produce analogs with the goal of discovering a new pharmaceutical agent. 

In another aspect, the present invention provides a process for determining 
the dissociation constant between a target molecule and a ligand that binds to that 
target molecule. That process comprises the steps of: a) generating a first two- 
dimensional *"W*HNMR correlation spectrum of a ^N-labelcd target molecule; 

20 b) titrating the labeled target molecule with various concentrations of a ligand; c) 
generating a two-dimensional *"W*H NMR correlation spectrum at each 
concentration of ligand from step (b); d) comparing each spectrum from step (c) 
both to the first spectrum from step (a) and to all other spectra from step (c) to 
quantify differences in those spectra as a function of changes in ligand 

25 concentration; and e) calculating the dissociation constant (Kd) between the target 
molecule and the ligand from those differences. 

Because of their importance in medicinal chemistry, a preferred target 
molecule for use in such a process is a polypeptide. In one preferred embodiment, 
a process of determining the dissociation constant of a ligand can be performed in 

30 the presence of a second ligand. In accordance with this embodiment, the 

labeled target molecule is bound to that second ligand before exposing that target to 
the test compounds. 

Binding or dissociation constants are measured by following the *^N/*H 
chemical shifts of the protein as a function of ligand concentration. A known 

35 concentration ([P]o) of the target moleule is mixed with a known concentration 
([L]q) of a previously identified ligand and the two-dimensional 15 N/ *H 
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correlation spectrum was acquired. From this spectrum, observed chemical shift 
values (6 0 bs) are obtained. The process is repeated for varying concentrations of 
the ligand to the point of saturation of the target molecule, when possible, in which 
case the limiting chemical shift value for saturation (6 sat ) is measured. 

5 In those situations where saturation of the target molecule is achieved, the 

dissociation constant for the binding of a particular ligand to the targer molecule is 
calculated using the formula: 

K D 

X 

where [P]o is the total molar concentration of target molecule; [L]o is the total molar 
10 concentration of ligand; and x is the molar concentration of the bound species. The 
value of x is determined from the equation: 



6 obs * 6 free 



X = 



15 where 6^ is the chemical shift of the free species; 6 0 b s is the observed chemical 
shift; and A is the difference between the limiting chemical shift value for saturation 
(6sai) toe chemical shift value of the target molecule free of ligand . 

The dissociation constant is then determined by varying its value until a 
best fit to the observed data is obtained using standard curve-fitting statistical 

20 methods. In the case where 6^ is not directly known, both Kd and 6 sa t are varied 
and subjected to the same curve-fitting procedure. 

The use of the process of the present invention to determine the dissociation 
or binding affinity of various ligands to various target molecules is set forth 
hereinafter in Examples 2 and 3. 

25 Preferred target molecules, means for generating spectra, and means for 

comparing spectra are the same as set forth above. 

The initial step in the design process is the identification of two or more 
ligands that bind to the specific target molecule. The identification of such ligands 
is done using two-dimensional *~*N/*H NMR correlation spectroscopy as set forth 

30 above. 

Once two or more ligands are identified as binding to the target molecule at 
different sites, a complex between the target molecule and ligands is formed. 
Where there are two ligands, that complex is a ternary complex. Quaternary and 
other complexes are formed where there arc three or more ligands. 



WO 97/1 847 J PCT/US96/18270 

15 

Complexes are formed by mixing the target molecule simultaneously or 
sequentially with the various ligands under circumstances that allow those ligands 
to bind the target. Means for determining those conditions are well known in the 
art. 

5 Once that complex is formed, its three-dimensional structure is determined. 

Any means of determining three-dimensional structure can be used. Such methods 
are well known in the art Exemplary and preferred methods are NMR and X-ray 
crystallography. The use of three-dimensional double- and triple resonance NMR 
to determine the three-dimensional structure of two ligands bound to the catalytic 
10 domain of stromelysin is set forth in detail hereinafter in Example 4. 

An analysis of the three-dimensional structure reveals the spatial orientation 
of the ligands relative to each other as well as to the conformation of the target 
molecule. First, tjie spatial orientation of each ligand to the target molecule allows 
for identification of those portions of the ligand directly involved in binding (i.e., 
15 those portions interacting with the target binding site) and those portions of each 
ligand that project away from the binding site and which portions can be used in 
subsequent linking procedures. 

Second, the spatial orientation data is used to map the positions of each 
ligand relative to each other. In other words, discrete distances between the 
20 spatially oriented ligands can be calculated. 

Third, the spatial orientation data also defines the three-dimensional 
relationships amongst the ligands and the target. Thus, in addition to calculating 
the absolute distances between ligands, the angular orientations of those ligands 
can also be determined. 
25 Knowledge of the spatial orientations of the ligands and target is then used 

to select linkers to link two or more ligands together into a single entity that 
contains all of the ligands. The design of the linkers is based on the distances and 
angular orientation needed to maintain each of the ligand portions of the single 
entity in proper orientation to the target 
30 The three-dimensional conformation of suitable linkers is well known or 

readily ascertainable by one of ordinary skill in the art While it is theoretically 
possible to link two or more ligands together over any range of distance and three- 
dimensional projection, in practice certain limitations of distance and projection are 
preferred. In a preferred embodiment, ligands are separated by a distance of less 
35 than about 15 Angstroms (A), more preferably less than about 10 A and, even 
more preferably less than about 5 A, 
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Once a suitable linker group is identified, the ligands are linked with that 
linker. Means for linking ligands are well known in the art and depend upon the 
chemical structure of the ligand and the linking group itself. Ligands are linked to 
one another using those portions of the ligand not directly involved in binding to 
5 the target molecule. 

A detailed description of the design of a drug that inhibits the proteolytic 
activity of stromelysin, which drug was designed using a process of the present 
invention is set forth hereinafter in Example 4. 

The following Examples illustrate preferred embodiments of the present 
10 invention and are not limiting of the specification and claims in any way. 

Example 1 

Preparation Of Uniformly — N-Labeled Target Molecules 
A. Stromelysin 

15 Human stromelysin is a 447-amino acid protein believed to be involved in 

proteolytic degradation of cartilage. Cartilage proteolysis is believed to result in 
degradative loss of joint cartilage and the resulting impairment of joint function 
observed in both osteoarthritis and rheumatoid arthritis. The protein possesses a 
series of domains including N-terminal latent and propetide domains, a C-terminal 

20 domain homologous with homopexin, and an internal catalytic domain. 

Studies have shown that removal of the N-terminal prosequence of 
approximately eighty amino acids occurs to convert the proenzyme to the 45 kDa 
mature enzyme. Furthermore, studies have shown that the C-terminal homopexin 
homologous domain is not required for proper folding of the catalytic domain or 

25 for interaction with an inhibitor. (See, e.g., A. L Marcy, Biochemistry . 3 0: 6476- 
6483 (1991). Thus, the 81-256 amino acid residue internal segment of stromelysin 
was selected as the protein fragment for use in identifying compounds which bind 
to and have the potential as acting as inhibitors of stromelysin. 

To employ the method of the present invention, it was necessary to prepare 

30 the 81-256 fragment (SEQ ID NO: 1 ) of stromelysin in which the peptide backbone 
was isotopically enriched with and l ^N. This was done by inserting a plasmid 
which coded for the production of the protein fragment into an E. coli strain and 

growing the genetically-modified bacterial strain in a limiting culture medium 

IS 13 
enriched with NH4CI and C-glucose. 
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The isotopically enriched protein fragment was isolated from the culture 
medium, purified, and subsequently used as the basis for evaluating the binding of 
lest compounds. The procedures for these processes are described below. 

Human skin fibroblasts (ATCC No. CRL 1507) were grown and induced 
5 using the procedure described by Clark et aL, Archiv. Biochem, and Biophvs . , 
241: 36-45 (1985). Total RNA was isolated from 1 g of cells using a Promega 
RNAgents® Total RNA Isolation System Kit (Cat.# Z51 10, Promega Corp., 2800 
Woods Hollow Road, Madison, WI 5371 1-5399) following the manufacturers 
instructions. A 1 f4g portion of the RNA was heat-denatured at 80" C for five 
10 minutes and then subjected to reverse transcriptase PGR using a GeneAmp® RNA 
PCR kit (Cat.# N808-0017, Applied Biosystems/Perkin-Elmer, 761 Main Avenue, 
Norwalk, CT 06859-0156) following the manufacturer's instructions. 

Nested PCR was performed using first primers (A) GAAATGAAGAGTC 
TTCAA (SEQ ID NO:3) and (B) GCGTCCCAGGTTCTGGAG (SEQ ID NO:4) 
15 and thirty-five cycles of 94°C, two minutes; 45" C, two minutes; and 72°C three 
minutes. This was followed by reamplification with internal primers (C) 
ATACCATGGCCTATCCAT TGGATGGAGC (SEQ ID NO:5) and (D) 
ATA GG ATCCTTA GGTCTCA GGGGA GTCAGG (SEQ ID NO:6) using thirty 
cycles under the same conditions described immediately above to generate a DNA 
20 coding for amino acid residues 1-256 of human stromelysin. 

The PCR fragment was then cloned into PCR cloning vector pT7Bluc(R) 
(Novagen, Inc., 597 Science Drive, Madison, WI 5371 1) according to the 
manufacturers instructions. The resulting plasmid was cut with Ncol and BamHI 
and the stromelysin fragment was subcloned into-the Novagen expression vector 
25 pET3d (Novagen, Inc., 597 Science Drive, Madison, WI 5371 1), again using the 
manufacturer's instructions. 

A mature stromelysin expression construct coding for amino acid residues 
81-256 plus an initiating methionine was generated from the 1-256 expression 
construct by PCR amplification. The resulting PCR fragment was first cloned into 
30 the Novagen pT7Blue(R) vector and then subcloned into the Novagen pET3d 
vector, using the manufacturer's instructions in the manner described above, to 
produce plasmid (pETST-83-256). This final plasmid is identical to that described 
by Qi-Zhuang et ah, Biochemistry . 3 1: 1 1231-1 1235 ( 1992) with the exception 
that the present codes for a peptide sequence beginning two amino acids earlier, at 
35 position 8 1 in the sequence of human stromelysin. 
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Plasmid pETST-83-256 was transformed into £. coli strain 
BL21(DE3)/pLysS (Novagen, Inc., 597 Science Drive, Madison, WI 5371 1) in 
accordance with the manufacturer's instructions to generate an expression strain, 
BL2 1 (DE3)/pLysS/pETST-255- 1 . 

5 A preculture medium was prepared by dissolving 1.698 g of 

Na2HP4*7H20 t 0.45 g of KH2PO4, 0.075 g NaCl, 0.150 g l ^NH4Cl, 0300 
DC-glucose, 300 pCL of 1M aqueous MgS04 solution and 15 }iL of aqueous 
CaCl2 solution in 150 mLof deionized water. 

The resulting solution of preculture medium was sterilized and transferred 

10 to asterile 500 mL baffle flasL Immediately prior to inoculation of the preculture 
medium with the bacterial strain, 150 f<L of a solution containing 34 mg/mL of 
chloramphenicol in 100% ethanol and 1.5 mL of a solution containing 20 mg/mL 
of ampicillin were added to the flask contents. 

The flask contents were then inoculated with 1 mL of glycerol stock of 

1 5 genetically-modified E. Coli , strain BL2 1 (DE3)/pLysS/pETST-255- 1 . The flask 
contents were shaken (225 rpm) at 37* C until an optical density of 0.65 was 
observed. 

A fermentation nutrient medium was prepared by dissolving 1 13.28 g of 
Na2HP4»7H20, 30 g of KH2PO4, 5 g NaCl and 10 mL of 1% DF-60 antifoam 
20 agent in 9604 mL of deionized water. This solution was placed in a New 

Brunswick Scientific Micros Fermenter (Edison, NJ) and sterilized at 121°C for 40 
minutes. 

Immediately prior to inoculation of the fermentation medium, the following 

pre-sterilized components were added to the fermentation vessel contents: 100 mL 

15 

25 of a 10% aqueous solution of NH4CI, 100 mL of a 10% aqueous solution of 
13 

C-glucose, 20 mL of an aqueous 1M solution of MgS04, 1 mL of an aqueous 
1M CaCl2 solution, 5 mL of an aqueous solution of thiamin hydrochloride ( 10 
mg/mL), 10 mL of a solution containing 34 mg/mL of chloramphenicol in 100% 
ethanol and 1.9 g of ampicillin dissolved in the chloramphenicol solution. The pH 

30 of the resulting solution was adjusted to pH 7.00 by the addition of an aqueous 
solution of 4N H2SO4. 

The preculture of E. Coli , strain BL27(DE3)/pLysS/pETST-255- 1 , from 
the shake-flask scale procedure described above was added to the f ermentor 
contents and cell growth was allowed to proceed until an optical density of 0.48 

35 was achieved. During this process, the fermenter contents were automatically 
maintained at pH 7.0 by the addition of 4N H2SO4 or 
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4N KOH as needed. The dissolved oxygen content of the fermenter contents was 
maintained above 55% air saturation through a cascaded loop which increased 
agitation speed when the dissolved oxygen content dropped below 55%. Air was 
fed to the fermenter contents at 7 standard liters per minute (SLPM) and the culture 
5 temperature was maintained at 37*C throughout the process. 

The cells were harvested by centrifugation at 17,000 x g for 10 minutes at 
4*C and the resulting cell pellets were collected and stored at -85°C. The wet cell 
yield was 3.5 g/L, Analysis of the soluble and insoluble fractions of cell lysates by 
sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) revealed 
10 that approximately 50% of the ^N-stromelysin was found in the soluble phase. 

The isotopically-labeled stromelysin fragment prepared as described above 
was purified employing a modification of the technique described by Yeetal . , 
Biochemistry, 31: 11231-11235(1992). 

The harvested cells were suspended in 20 mM Tris-HCl buffer (pH 8.0) 
15 sodium azide solution containing 1 mM MgCl2, 0.5 mM ZnCl2> 25 units/mL of 
Benzonase® enzyme, and an inhibitor mixture made up of 4-(2-aminoethyl)- 
benzenesulfonyl fluoride ("AEBSF"), Leupeptin®, Aprotinin®, and Pepstatin® 
(all at concentrations of 1 fig/mL. AEBSF, Leupeptin®, Aprotinin®, and 
Pepstatin® are available from American International Chemical, 17 Strathmore 
20 Road, Natick, MA 01760.) 

The resulting mixture was gently stirred for one hour and then cooled to 
4°C. The cells were then sonically disrupted using a 50% duty cycle. The 
resulting lysate was centrifuged at 14,000 rpm for 30 minutes and the pellet of 
insoluble fraction frozen at -80° C for subsequeni processing (see below). 
25 Solid ammonium sulfate was added to the supernatant to the point of 20% 

of saturation and the resulting solution loaded onto a 700 mL phenyl sepharose fast 
flow ("Q-Sepharose FF") column (Pharmacia Biotech., 800 Centennial Ave., P. 
O. Box 1327, Piscataway, NJ 08855). Prior to loading, the sepharose column 
was equilibrated with 50 mM Tris-HCl buffer (pH 7.6 at 4°C), 5 mM CaCl2, and 
30 1 M (NH4)2S04. The loaded column was eluted with a linear gradient of 
decreasing concentrations of aqueous (NH4)2S04 (from 1 down to 0 M) and 
increasing concentrations of aqueous CaCl2 (from 5 to 20 mM) in Tris-HCl buffer 
at pH 7.6. 



35 
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The active fractions of eluate were collected and concentrated in an Amicon 
stirred cell (Amicon, Inc., 72 Cherry Hill Drive, Beverly, MA 01915). The 
concentrated sample was dialyzed overnight in the starting buffer used with the Q- 
Sepharose FF column, 50 mM Tris-HCl (pH 8.2 at 4°C) with 10 mM CaCl2- 

5 The dialyzed sample was then loaded on the Q-Sepharose FF column and 

eluted with a linear gradient comprising the starting buffer and 200 mM NaCl. The 
purified soluble fraction of the isotopically-labeled stromelysin fragment was 
concentrated and stored at 4 W C 

The pellet was solubilized in 8M guanidine-HCI. The solution was 

10 centrifuged for 20 minutes at 20,000 rpm and the supernatant was added dropwise 
to a folding buffer comprising 50 mM Tris-HCl (pH 7.6), 10 mM CaCl2 0.5 mM 
ZnCl2 and the inhibitor cocktail of AEBSF, Leupeptin® , Aprotinin®, and 
Pepstatin® (all at concentrations of 1 //g/mL). The volume of folding buffer was 
ten times that of the supernatant. The mixture of supernatant and folding buffer 

1 5 was centrifuged at 20,000 rpm for 30 minutes. 

The supernatant from this centrifugation was stored at4 tt C and the pellet 
was subjected twice to the steps described above of solubilization in guanidine- 
HCI, refolding in buffer, and centrifugation. The final supernatants from each of 
the three centrifugations were combined and solid ammonium sulfate was added to 

20 the point of 20% saturation. The resulting solution thus derived from the insoluble 
fraction was subjected to purification on phenyl Sepharosc and Q-Sepharose as 
described above for the soluble fraction. 

The purified soluble and insoluble fractions were combined to produce 
about 1 .8 mg of purified isotopically-labeled stromelysin 81-256 fragment per 

25 gram of original cell paste. 

B. Human papillomavirus (HPV) E2 Inhibitors 

The papillomaviruses are a family of small DNA viruses that cause genital 
warts and cervical carcinomas. The E2 protein of HPV regulates viral transcription 

30 and is required for viral replication. Thus, molecules that block the binding of E2 
to DNA may be useful therapeutic agents against HPV. The protein rather than the 
DNA was chosen as a target, because it is expected that agents with greater 
selectivity would be found that bind to the protein rather than the DNA. 

The DNA-binding domain of human papillomavirus E2 was cloned from 

35 the full length DNA that codes for E2 using PGR and overexpressed in bacteria 

using the T7 expression system. Uniformly 15 N-labeled protein was isolated from 
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bacteria grown on a minimal medium containing N-labclcd protein was isolated 

from bacteria grown on a minimal medium containing *~*N-Iabeled ammonium 

chloride. The protein was purified from the bacterial cell lysate using an S- 

sepharose FastFlow column pre-equilibrated with buffer (50 mM Tns, 100 mM 

5 NaCl, 1 mM EDTA, pH = 83). 

The protein was eluted with a linear gradient of 100-500 mM NaCl in 

buffer, pooled, and applied to a Mono-S column at a pH = 7,0. The protein was 

eluted with a salt gradient (100-500 mM), concentrated to 03 mM, and exchanged 

into a TRIS (50 mM, pH = 7.0 buffered H2GD2O (9/1) solution containing 

10 sodium azide (0.5%). 

C RAF 

Uniformly ^N-labeled Ras-binding domain of the RAF protein was 
prepared as described in Emerson et al., Biochemistry . 34 (21): 691 1-6918 
15 (1995). 



D. FKBP 

Uniformly ^N-labeled recombinant human FK binding protein (FKBP) 
was prepared as described in Logan et al., J. Mol. Biol. . 236: 637-648 (1994). 

20 

Example 2 

Screening Compounds Using Two-Pi mensional — N/- H NMR Correlation 
Spectral Analysis 

The catalytic domain of stromelysin was prepared in accordance with the 
25 procedures of Example 1 . The protein solutions used in the screening assay 
contained the uniformly ^N-labeled catalytic domain of stromelysin (03 mM), 
acetohydroxamic acid (500 mM), CaCl2 (20 mM), and sodium azide (0.5%) in a 
H2CO2O (9/1) TRIS buffered solution (50 mM, pH=7.0). 

Two-dimensional *^N/*H NMR spectra were generated at 29*C on a 
30 Broker AMX500 NMR spectrometer equipped with a triple resonance probe and 
Broker sample changer. The *^N/*H HSQC spectra were acquired as 80 x 1024 
complex points using sweep widths of 2000 Hz ( 15 N, t 1 ) and 8333 Hz ( 1 H, t2). 
A delay of 1 second between scans and 8 scans per free induction decay(fid) were 
employed in the data collection. All NMR spectra were processed and analyzed on 
35 Silicon Graphics computers using in-house-written software. 
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A first two-dimensional N/ H NMR correlation spectrum was acquired 
for the ^N-labeled stromelysin target molecule as described above. The 
stromelysin target was then exposed to a database of test compounds. Stock 
solutions of the compounds were made at 100 mM and 1 ML In addition, a 
5 combination library was prepared that contained 8-10 compounds per sample at a 
concentration of 100 mM for each compound. 

The pH of the 1 M stock solution was adjusted with acetic acid and 
ethanolamine so that no pH change was observed upon a 1/10 dilution with a 100 
mM phosphate buffered solution (pH = 7.0). It is important to adjust the pH, 
10 because small changes in pH can alter the chemical shifts of the biomolecules and 
complicate the interpretation of the NMR data. 

The compounds in the database were selected on the basis of size 
(molecular weight = 100-300) and molecular diversity. The molecules in the 
collection had different shapes (e.g., flat aromatic rings(s) > puckered aliphatic 
15 rings(s), straight and branched chain aliphatics with single, double, or triple 
bonds) and diverse functional groups (e.g., carboxylic acids, esters, ethers, 
amines, aldehydes, ketones, and various heterocyclic rings) for maximizing the 
possibility of discovering compound that interact with widely diverse binding sites. 
The NMR samples were prepared by adding 4 y\ of the DMSO stock 
20 solution of the compound mixtures that contained each compound at a 

concentration of 100 mM to 0.4 ml H2OD2O (9/1) buffered solution of the 
uniformly ^N-labeled protein. The final concentration of each of the compounds 
in the NMR sample was about 1 mM. 

In an initial screen, two compounds were found that bind to the catalytic 
25 domain of stromelysin. Both of these compounds contain a biaryl moiety. Based 
on these initial hits, structurally similar compounds were tested against 
stromelysin. The structure of those biaryl compounds is represented by the 
structure 1, below. (See Table 1 for definitions of R1-R3 and A 1-A3). 




In the second round of screening, binding was assayed both in the absence 
and in the presence of saturating amounts of acetohydroxamic acid (500 mM). 
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Many of the biaryl compounds were found to bind the catalytic domain of 

stromelysin. FIG. 4 shows a representative two-dimensional *~W*H NMR 

correlation spectrum before and after exposure of stromelysin to a biaryl test 

compound. It can be seen from FIG. 4 that the compound caused chemical shifts 
15 

of N-si tes such as those designated W 124, T 187, A 199 and G204. 

These sites correspond to a tryptophan (Tip) residue at position 124, a 
threonine (Thr) at position 187, an alanine (Ala) at positionl99, and a glycine 
(Gly) at position 204 of SEQ ID NO. L FIG. 9 shows the correlation between the 
NMR binding data and a view of the NMR-deri ved three-dimensional structure of 
the catalytic domain of stromelysin. The ability to locate the specific binding site of 
^particular ligand is an advantage of the present invention. 

Some compounds only bound to stromelysin in the presence of hydroxamic 
acid. Thus, the binding affinity of some compounds was enhanced in the presence 
of the hydroxamic acid (i. e. cooperative). These results exemplify another 
important capability of the present screening assay: the ability to identify 
compounds that bind to the protein in the presence of other molecules. 

Various biaryl compounds of structure 1 were tested for binding to 
stromelysin at differing concentrations. The 15 N/*H spectra generated at each 
concentration were evaluated to quantify differences in the spectra as a function of 
compound concentration. A binding or dissociation constant (KD)was calculated, 
using standard procedures well known in the art, from those differences. The 
results of this study are shown in Table 1 . The values for R1-R3 and A 1 -A3 in 
Table 1 refer to the corresponding positions in the structure 1, above. 



Table 1 



Compound 
No. 
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0.2 
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8 


OCOCH3 
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0.3 
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OH 


C 


c 
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10 
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0.4 


11 


OH 
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0.3 


12 


OH 
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CN 


C 


c 


c 


0.02 



The data in Table 1 show the utility of a process of the present invention in 
determining dissociation or binding constants between a ligand and a target 
molecule. 

5 * Another advantage of an NMR screening assay of the present invention is 
the ability to correlate observed chemical shifts from the two-dimensional l5 N/*H 
NMR correlation spectra with other spectra or projections of target molecule 
configuration. The results of a representative such correlation are shown in FIG. 
9, which depicts regions within the polypeptide at which binding with the substrate 

10 molecule is most likely occurring. In this Figure, the apparent binding regions in 
stromelysin are shown for Compound 1 (from Table 1). 

Compounds from the database were screened in a similar manner for 
binding to the DNA-binding domain of the E2 protein. Those compounds had the 
structure II below, where R1-R4 and A arc defined in Table 2. 

15 




NMR experiments were performed at 29*C on a Brukef AMX500 NMR 
spectrometer equipped with a triple resonance probe and Bruker sample changer. 
20 The *~*N-/*H HSQC spectra were acquired as 80 x 1024 complex points using 
sweep widths of 2000 Hz ( 15 N,ti) and 8333 Hz ( *H, 12). A delay of 1 second 
between scans and 4 scans per free induction decay were employed in the data 
collection. All NMR spectra were processed and analyzed on Silicon Graphics 
computers. 

25 FIGs. 2 and 3 show representative two-dimensional *^N/*H NMR 

correlation spectra before and after exposure of the DNA-binding domain of E2 to 
a first and second test compound, respectively. 
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It can be seen from FIG. 2 that the first test compound caused chemical 
shifts of ^N-sites such as those designated 1 15, Y21 , R22 and L23. Those sites 
correspond to an isoleucine (lie) residue at position 15, a tyrosine residue (Tyr) at 
position 21, an arginine (Arg residue at position 22 and a leucine (Leu) residue at 
5 position 23 of SEQ ID NO. 6. 

It can be seen from FIG. 3 that the second test compound caused chemical 
shifts in the particular l ~*N-$Ues designated 16, Gl 1, H38, and T52. Those sites 
correspond to an isoleucine (He) residue at position 6, a glycine (Gly) residue at 
position 1 1, a histidine (His) residue at position 38 and a threonine (Thr) at 
10 position 52 of SEQ ID NO. 6. 

FIGs. 7 and 8 show the correlation between those NMR binding data and a 
view of the NMR-derived three-dimensional structure of the DNA-binding domain 
of E2. 

Several structurally similar compounds caused chemical shift changes of 
15 the protein signals when screened at a concentration of I mM Two distinct sets of 
amide resonances were found to change upon the addition of the compounds: one 
set of signals corresponding to amides located in the 6-barrel formed between the 
two monomers and a second set corresponding to amides located near the DNA- 
binding site. 

20 For example, compounds containing two phenyl rings with a carboxylic 

acid attached to the carbon Unking the two rings only caused chemical shift changes 
to the amides in the DNA-binding site. In contrast, benzophenones and 
phenoxyphenyl-containing compounds only bound to the B-barrel. Other 
compounds caused chemical shift changes of both sets of signals but shifted the 

25 signals in each set by different amounts, suggesting the presence of two distinct 
binding sites. 

By monitoring the chemical shift changes as a function of ligand 
concentration, binding constants for the two binding sites were also measured. 
The results of those studies are summarized below in Table 2. 



WO 97/18471 



PCT/US96/I8270 



26 
Table 2 



Comp. 
No. 


A 


*M 






I? A 


L/risx 

KrXmM) 


Ko(mM) 


Filter 
binding 
flsssy 


13 


CO 


H 


H 


H 


OH 


>50 


0,6 




14 


o 


H 


H 


H 


CH 2 OH 


>50 


2.0 


- 


15 


_a 


H 


H 


COO 


H 


2.0 


>50 


+ 


16 


_a 


a 


CI 


COO 


H 


0.1 


>50 




17 


_a 


H 


H 


CH2COO 


H 


4.2 


4.9 




18 




H 


H 


CH=CHCOO 


H 


1.2 


6.2 


+ 


19 


O 


H 


H 


CH 2 CH 2 CH(CH3) 
*CH 2 COO 


H 


0.5 


0.2 


+ 


20 


0 


H 


H 


COCH2CH2COO 


H 


2.7 


4.8 





a a dash (-) for A indicates no atom (i.e. byphenyl linkage) 



5 Uniformly N-labeled Ras-binding domain of the RAF protein was 

prepared as described in Example 1 and screened using two-dimensional *^N/*H 
NMR correlation spectral analysis in accordance with the NMR procedures 
described above. The results of a representative study are shown in FIG. 5 T which 
depicts two-dimensional *^N/*H NMR correlation spectra both before and after 

10 exposure to a test compound. 

Uniformly ^N-labeled FKBP was prepared as described in Example 1 and 
screened using two-dimensional *~W*H NMR correlation spectral analysts in 
accordance with the NMR procedures described above. The results of a 
representative study are shown in FIG. 6 ( which depicts two-dimensional *~*N/ l H 

15 NMR correlation spectra both before and after exposure to a test compound. 

Example 3 

Comparison of NMR, Enzymatic. Filter Binding and Gel Shift Screening Assays 
Studies were performed to compare binding constants of ligands to various 
20 biomolecules, determined by the NMR method of the present invention, to similar 
results obtained from prior art methods. 

In a first study, binding constants were determined, both by the NMR 
method of the present invention, and by a prior art enzymatic assay. The target 
molecule was the catalytic domain of stromelysin prepared in accordance with the 
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procedures of Example 1. The NMR binding constants, Kd, were derived using 
two-dimensional *^N/*H NMR correlation spectroscopy as described in Example 
2. The Kd values so obtained were compared to an inhibition constant K\ as 
determined in an enzymatic assay. 

5 The enzymatic assay measured the rate of cleavage of a fluorogenic 

substrate by following the fluorescence increase upon peptide cleavage which 
causes a separation between the fluorophore and quencher. Enzymatic activity was 
measured using a matrix of different concentrations of acetohydroxamic acid and 
biaryl compounds. The assay is a modification of the method described by H. 

10 Weingarten, et al in Anal, Biochenu 1 4 7: 437-440 (1985) employing the 

fluorogenic substrate properties described by E. Matayoshi, et ai in Science : 247: 
954-958(1990). 

Eight acetohydroxamic acid concentrations were used ranging from 0.0 to 
1 .0 M, and six compound concentrations were used, resulting in a total of 48 
15 points. Individual compound concentration varied due to solubility and potency. 
All NMR measurements were performed in the presence of 500 mM 
acetohydroxamic acid, except for the titration of acetohydroxamic acid itself. 
Dissociation constants were obtained from the dependence of the observed 
chemical shift changes upon added ligand. Inhibition constants were then obtained 
20 from the inhibition data using standard procedures. 

The results of these studies are summarized below in Table 3, which shows 
the comparison of NMR-derived dissociation constants (Kd) with inhibition 
constants measured in the enzyme assay (Kj) using a fluorogenic substrate. 

25 Table 3 



Compound 
No. 


NMR Kd 
(mM) 


Assay Ki 
(mM) 


4 


1.6 


7.4 


7 


0.17 


0.32 


9 


0.16 


0.70 


10 


0.40 


1.8 


12 


0.02 


0.11 


Acetohydroxamic acid 


17.0 


21.1 
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The data in Table 3 show that a NMR process of the present invention 
provides a rapid, efficient and accurate way of determining dissociation or binding 
constants of ligands to target biomolecules. Comparison of the binding constants 
determined by the two methods result in the same ranking of potencies of the 
5 compounds tested. That is, while the values for a given substrate as determined by 
the two methods are not equal, they are proportional to one another. 

In a second study, the results for binding of the DNA-binding domain of 
E2 to its target DNA were obtained by prior art methods and compared with results 
obtained by the method of the present invention. The target was the DNA-binding 
10 domain of E2, prepared in accordance with the procedures of Example 1 . NMR 
screening assays and NMR processes for determining ligand dissociation constants 
were performed as set forth above in Example 2. 

The binding constant from the NMR process was compared to the results 

of a physical, filter binding assay that measured binding of DNA to the target. The 

15 high-throughput filter binding assay was performed using E2, prepared according 

33 

to Example 2 above. The P-Iabeled DNA construct comprised a 10329 base 
pair plasmid formed by inserting the HPV-1 1 genome, containing three high 
affinity and one low affinity E2 binding sites, into the PSP-65 plasmid (Promega, 
Madison, Wl). 

20 The binding affinities at the different sites as determined by NMR were 

compared for a subset of the compounds to the inhibition of E2 binding to DNA as 
measured in the filter binding assay. As shown in Table 2 above, the activities 
determined in the filter binding assay correlated closely with the binding affinities 
calculated from the amides of the DNA-binding site but not to the affinities 

25 measured for the B-barrel site. This is consistent with the relative locations of each 
site. 

In an alternative study, a comparison of the NMR-determined binding 
results was made with similar results obtained by a prior art gel-shift assay using 
techniques well known in the art. The gel-shift assay was performed using a GST 

30 fusion protein which contained full length E2 and a 33 P-labeled 62 base pair DNA 
fragment containing two E2 binding sites. 

The method identified numerous compounds which gave positive results in 
the gel-shift assay. Some of these positive results, however, were believed to be 
due to binding to the DNA, since in these cases, no binding to the E2 protein was 

35 observed using the NMR method of this invention. These compounds were 
shown to indeed bind to DNA rather than to E2, as evidenced by changes in the 
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chemical shifts of the DNA rather than the protein upon the addition of the 
compounds. These data show that yet another advantage of the present invention 
is the ability to minimize the occurrence of false positives. 

5 Example 4 

Design of a potent non-peptide inhibitor of stromelvsin 

Studies were performed to design new ligands that bound to the catalytic 
domain of stromelysin. Because siromelysin undergoes autolysis, an inhibitor was 
sought to block the degradation of stromelysin. That inhibitor would facilitate the 
10 screening of other potential ligands that bind to other sites on the enzyme. 

The criteria used in selecting compounds in the screening for other binding 
sites was based primarily on the size of the ligand. The smallest ligand was sought 
that had enough solubility to saturate (>98% occupancy of enzyme) and inhibit the 
enzyme. 

15 The cloning, expression, and purification of the catalytic domain of 

stromelysin was accomplished using the procedures set forth in Example 1. An 
initial step in the design of the new ligand was the identification of a first ligand 
that bound to the stromelysin target Such identification was carried out in 
accordance with a two-dimensional *~W*H NMR correlation screening process as 

20 disclosed above. 

A variety of hydroxamic acids of the general formula R-(CO)NHOH were 
screened for binding to stromelysin using the procedures set forth in Example 2. 
Of the compounds tested, acetohydroxamic acid [CH3(CO)NHOH] best satisfied 
the selection criteria: it had a binding affinity for stromelysin of 17 mM and had 

25 good water solubility. At a concentration of 500 mM, acetohydroxamic acid 

inhibited the degradation of the enzyme, allowing the screening of other potential 
ligands. 

The second step in the design process was the identification of a second 
ligand that bound to the target stromelysin at a site different from the binding site of 
30 acetohydroxamic acid. This was accomplished by screening compounds for their 
ability to bind stromelysin in the presence of saturating amounts of 
acetohydroxamic acid. Details of procedures' and results of this second 
identification step are set forth above in Example 2. 

The compound identified as a second ligand from these studies and used in 
35 subsequent design steps was the compound designated as Compound #4 in Table 1 
(See Example 2). 
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The next step in the design process was to construct a ternary complex of 
the target stromelysin, the first ligand and the second ligand. This was 
accomplished by exposing the stromelysin target to the two ligands under 
conditions that resulted in complex formation. The three-dimensional structure of 
5 the ternary complex was then determined using NMR spectroscopy as described 
below. 

The *H, ^C, and backbone resonances of stromelysin in the ternary 
complex were assigned from an analysis of several 3D double- and triple- 
resonance NMR spectra (A. Bax. et aL Acc. Chem. Res. . 26: 131-138 (1993)). 

10 The C a resonances of adjacent spin systems were identified from an analysis of 
three-dimensional (3D) HNCA (L.Kay etal. f J. Magn. Rcson. . 89: 496-514 
(1990)) and HN(CO)CA (A. Bax, et ai t J. Bio. NMR . 1 : 99 (1991)) spectra 
recorded with identical spectral widths of 1773 Hz (35.0 ppm), 3788 Hz (30. 1 
ppm), and 8333 Hz (16.67 ppm) in the Fi( 15 N), F2( 13 C) and F3(*H) 

15 dimensions, respectively. 

The data matrix was 38(ti) x 48(t2) x 1024(t3) complex points for the 
HNCA spectrum, and 32(ti) x 40(t2) x 1024(t3) complex points for the 
HN(CO)CA spectrum. Both spectra were acquired with 16 scans per increment. A 
3D CBCA(CO)NH spectrum (S. Grzesiek, et ai 7 J. Am. Chem. Soc. . 114: 6261- 

20 6293 (1992)) was collected with 32(ti, 15 N) x48(t2, 13 C) x 1024(t3, ! H) 

complex points and 32 scans per increment. Spectral widths were 1773 Hz (35.0 
ppm), 7575.8 Hz (60.2 ppm), and 8333 Hz (16.67 ppm) in the 15 N, 13 C and l H 
dimensions, respectively. 

For all three spectra, the *H carrier frequency was set on the water 

15 13 
25 resonance and the N carrier frequency was at 1 19. 1 ppm. The C earner 

frequency was set to 55.0 ppm in HNCA and HN(CO)CA experiments, and 46.0 

ppm in the CBCA(CO)NH experiment. 

The backbone assignments were confirmed from an analysis of the 

crosspeaks observed in an 15 N-separated 3D NOESY-HSQC spectrum and a 3D 

30 HNHA-J spectrum. The 15 N-separated 3D NOESY-HSQC spectrum (S. Fcsik, 

etaU J. Maen. Reson. . 87: 588-593 (1988)); D. Marion, etal % J. Am. Chem. 

Soc. Ill: 1515-1517 (1989)) was collected with a mixing time of 80 ms. A total 

of68(ti l 15 N)x96(t2, l H)x 1024{t3, *H) complex points with 16scansper 



increment were collected, and the spectral widths were 1773 Hz (35.0 ppm) for the 
If 

1, 



35 15 N dimension, 6666.6 Hz (t2, 1 H, 133 ppm), and 8333 Hz (16.7 ppm) for the 



H dimension. 
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The 3D HNHA-J spectrum (G. Vuister, et a/., J. Am. Chem. Soc. . 1 1 5: 

7772-7777 ( 1993)), which was also used to obtain JHNHa coupling constants, 

was acquired with 35(1], 15 N)x64(t2, ] H)xl024(t3, l H) complex points and 

32 scans per increment. Spectral widths and carrier frequencies were identical to 

5 those of the 15 N-separated NOESY-HSQC spectrum. Several of the H B signals 

were assigned using the HNHB experiment. The sweep widths were the same as 

in the 15 N-separated NOESY-HSQC spectrum that was acquired with 32(ti, 15 N) 

x 96(t2, *H) x 1024 (t3, *H) complex points. 
1 13 

The H and C chemical shifts were assigned for nearly all sidechain 
10 resonances. A 3D HCCH-TOCSY spectrum (L. Kay, et ai 9 J, Magn. Reson. 
101b: 333-337 ( 1993)) was acquired with a mixing time of 13 ms using the 
DIPSI-2 sequence (S. Rucker, et al y Mol. Phvs. . 68: 509 (1989)) for 13 C 
isotropic mixing. A total of 96 (ti, 13 C)x96(t2, *H) x 1024(13, ^complex 
data points were collected with 16 scans per increment using a spectral width of 
15 10638 Hz (70.8 ppm, wi), 4000 Hz (6.67 ppm, w 2 ), and 4844 (8.07 ppm, W3). 
Carrier positions were 40 ppm, 2.5 ppm, and at the water frequency for the 13 C, 
indirectly detected *H, and observed *H dimensions, respectively. 

Another 3D HCCH-TOCSY study was performed with the 13 C carrier at 
122.5 ppm to assign the aromatic residues. The spectra were collected with 
20 36(ti, C) x 48(t2, *H) x 1024 (t3, l U) complex points with spectral widths of 
5263 Hz (35.0 ppm, wi), 3180 Hz (530 ppm, w 2 ), and 10,000 (16.7 ppm, w 3 ). 

Carrier positions were 122.5 ppm, 7.5 ppm, and at the water frequency for the 

13 11 
C, indirectly detected H, and observed H dimensions, respectively. 

A 13 C-separated 3D NOESY-HMQC spectrum (S. Fesik, et a/., J. Magn. 

25 Resoru, 87: 588-593 (1988)); D.Marion, etaL J. Am. Chem. Soc. . Ill: 1515- 

1517(1989)) was recorded using a mixing time of 75 ms. A total of 80 (ti, 13 C) 

x 72 (t2, *H) x 1024 (t3, *H) complex data points with 16 scans per increment 

were collected over spectral widths of 10638 Hz (70.49 ppm, wj), 6666.6 Hz 

(13.3 ppm, W2)> and 83333 Hz (16.67 ppm, W3). The ^carrier frequencies 

13 

30 were set to the water resonance, and the C carrier frequency was placed at 40.0 
ppm. 

Stereospecific assignments of methyl groups of the valine and leucine 
residues were obtained by using a biosynthetic approach (Neri et a/., Biochem. . 
28: 7510-7516 (1989)) on the basis of the 13 C- 13 C one-bond coupling pattern 
35 observed in a high-resolution 1 H, 13 C-HSQC spectrum (G. Bodenhausen, et a/., 
J. Chem. Phvs. Lett. . 69: 185-189 (1980)) of a fractionally 13 C-labeled protein 
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sample. The spectrum was acquired with 200( C, i\) x 2048( H, t2) complex 

points over spectral widths of 5000 Hz (39.8 ppm, 13 C) and 8333 Hz (16.7 ppm, 
1 13 

H). Carrier positions were 20.0 ppm for the C dimension, and at the water 

frequency f or the H dimension. 1 2 

5 To detect NOEs between the two ligands and the protein, a 3D C- 

13 

filtered, Credited NOESY spectrum was collected. The pulse scheme consisted 
13 

of a double C-filter sequence (A. Gemmeker, et al, J. Magn. Reson. . 9 6: 199- 
204 (1992)) concatenated with a NOESY-HMQC sequence (S. Fesik, et al f ^ 
Maen. Reson. » 87: 588-593 (1988)); D. Marion, etai, J. Am. Chem. Soc. . 1 1 1: 

10 1515-1517 (1989)) . The spectrum was recorded with a mixing time of 80 ms, 
and a total of 80 (ti, 13 C)x80(t2, 1 H)xl024(t3, *H) complex points with 16 
scans per increment. Spectral widths were 8865 Hz ( 17.73 ppm, wi), 6667 Hz 
(1333 ppm, W2), and 8333 Hz (16.67 ppm, W3), and the carrier positions were 
40.0 ppm for the carbon dimension and at the water frequency for both proton 

15 dimensions. 

To identify amide groups that exchanged slowly with the solvent, a series 
of 1 H, 15 N-HSQC spectra (G. Bodcnhauscn, et a/., J. Chem. Phys. Lett. , 69: 
185-189 (1980)) were recorded at 25°C at 2 hr intervals after the protein was 
exchanged into D2O. The acquisition of the first HSQC spectrum was started 2 
20 hrs. after the addition of D2O. 

All NMR spectra were recorded at 25'C on a Bruker AMX500 or A MX 600 
NMR spectrometer. The NMR data were processed and analyzed on Silicon 
Graphics computers. In all NMR experiments, pulsed field gradients were applied 
where appropriate as described [A. Bax, etai, J. Magn. Reson. . 99; 638 (1992)) 
25 to afford the suppression of the solvent signal and spectral artifacts. Quadrature 
detection in indirectly detected dimensions was accomplished by using the States- 
TPPI method (D. Marion, et a/., J. Am. Chem. Soc. , 1 1 1: 1515-1517 (1989)). 
Linear prediction was employed as described (E. Olejniczak, et a/., J. Magn. 
Reson. . 8 7: 628-632 (1990)). 
30 The derived three-dimensional structure of the ternary complex was then 

used to define the spatial orientation of the first and second ligands to each other as 
well as to the target stromelysin molecule. 

Distance restraints derived from the NOE data were classified into six 
categories based on the NOE cross peak intensity and given a lower bound of 1 .8 
35 A and upper bounds of 2.5 A, 3.0 A, 3.5 A, 4.0 A, 4.5 A, and 5.0 A, 

respectively. Restraints for <j> torsional angles were derived from 3 JHNHct 
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coupling constants measured from the 3D HNHA-J spectrum (G. Vuister, et a/., L_ 
Am. Chem. Soc. . 1 1 5: 7772-7777 (1993)). The angle was restrained to 
120%±40% for 3 JHNHa > 8.5 Hz, and 60%±40% for 3 JHNHa < 5 Hz. 

Hydrogen bonds, identified for slowly exchanging amides based on initial 

5 structures, were defined by two restraints: 1 .8-2.5 A for the H-O distance and 1.8- 
3.3 A for the N-O distance. Structures were calculated with the X-PLOR 3. 1 
program (A. Brlinger, "XPLOR 3.1 Manual/ Yale University Press, New Haven, 
1992) on Silicon Graphics computers using a hybrid distance geometry-simulated 
annealing approach (M Nilges, et aL, FEBS Lett. . 2 29: 317-324 (1988)). 

10 A total of 1032 approximate interproton distance restraints were derived 

*from the NOE data In addition, 21 unambiguous intermolecular distance restraints 
were derived from a 3D 12C-filtered, 13C-edited NOESY spectrum. Of the 1032 
NOE restraints involving the protein, 341 were intra-residue, 410 were sequential 
or short-range between residues separated in the primary sequence by less than five 

15 amino acids, and 281 were long-range involving residues separated by at least five 
residues. 

In addition to the NOE distance restraints, 14 <j> dihedral angle restraints 

were included in the structure calculations that were derived from three-bond 

3 

coupling constants ( JHNHa) determined from an HNHA-J spectrum (G. 
20 Viioster, etol., J. Am. Chem. Soc. . 1 1 5: 7772-7777 ( 1993)). The experimental 
restraints also included 120 distance restraints corresponding to 60 hydrogen 
bonds. The amides involved in hydrogen bonds were identified based on their 
characteristically slow exchange rate, and the hydrogen bond partners from initial 
NMR structures calculated without the hydrogen bond restraints. The total number 
25 of non-redundant, experimentally-derived restraints was 1 166. 

The structures were in excellent agreement with the NMR experimental 
restraints. There were no distance violations greater than 0.4 A, and no dihedral 
angle violations greater than 5 degrees. In addition, the simulated energy for the 
van der Waals repulsion term was small, indicating that the structures were devoid 
30 of bad inter-atomic contacts. 

The NMR structures also exhibited good covalent bond geometry, as 
indicated by small bond-length and bond-angle deviations from the corresponding 
idealized parameters. The average atomic root mean square deviation of the 8 
structures for residues 93-247 from the mean coordinates was 0.93 A for backbone 
35 atoms (C*. N, and C), and 1.43 A for all non-hydrogen atoms. 
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A ribbon plot of the ternary complex involving stromelysin, 
acetohydroxamic acid (the first ligand), and the second ligand is shown in Fig 10. 
The structure is very similar to the global fold of other matrix metalloproteinases 
and consists of a five-stranded B-sheet and three a-helices. 

5 The catalytic zinc was located in the binding pocket. It was coordinated to 

three histidines and the two oxygen atom of acetohydroxamic acid. A biaryl group 
of the second ligand was located in the ST pocket between the second helix and 
the loop formed from residues 218-223. This deep and narrow pocket is lined 
with hydrophobic residues which make favorable contacts with the ligand. 

10 Based on the three-dimensional structure of the ternary complex as 

determined above and the structure/activity relationships observed for the binding 
to stromelysin of structural analogs of the second ligand (i.e., other biaryl 
compounds), new molecules were designed that linked together the 
acetohydroxamic acid to biaryls. 

15 As shown in Table 4 below, the initial biaryls chosen contained an oxygen 

linker and the absence or presence of CN para to the biaryl linkage. Initial linkers 
contained varying lengths of methylene units. Means for linking compounds with 
linkers having varying lengths of methylene units are well known in the arl. 

20 Table 4 

H 




Compound 


X 


R 


Stromelysin 
Inhibition 


21 


(CH 2 ) 2 


H 


0.31 


22 


(CH 2 ) 3 


H 




23 


(CH 2 ) 4 


H 


38%@100/<M 
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24 


(CH 2 ) 5 


H 


43%@100/*M 


25 


(CH 2 ) 2 


CN 


0.025 iaM 


26 


(CH 2 ) 3 


CN 


3.4 


27 


(CH 2 ) 4 


CN 


3.5 nM 


28 


(CH 2 ) 5 


CN 


1.7 /^M 



As expected based on the better binding of the CN substituted biaryls to 
stromelysin, the CN derivatives exhibited better stromelysin inhibition. The 
compound that exhibited the best inhibition of stromelysin contained a linker with 
5 two methylene units. 

The present invention has been described with reference to preferred 
embodiments. Those embodiments are not limiting of the claims and specification 
in any way. One of ordinary skill in the art can readily envision changes, 
modifications and alterations to those embodiments that do not depart from the 
10 scope and spirit of the present invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

5 

(i) APPLICANT: Fesik, Stephen W. 

Hajduk, Philip j. 

«« » TITLE 0F INVE NTION: Use of Nuclear Magnetic 

10 Resonance to y ^ 

Identifyify Ligands to Target Biomolecules 
(iii) NUMBER OF SEQUENCES: 6 

15 . (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Steven F. Weinstock, Dept. 377 

AP6D, Abbott ^ 
Laboratories 

(B) STREET: 100 Abbott Park Road 
20 (C) CITY: Abbott Park 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

25 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 
M #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

35 (C) CLASSIFICATION: 

(Viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Janssen, Jerry F. 

(B) REGISTRATION NUMBER: 29,175 

40 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (708) 937-4558 

(B) TELEFAX: (708) 938-7742 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



45 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

5 

Phe Arg Thr Phe Pro Gly lie Pro Lys Trp Arg Lys Thr 
1 5 10 

His Leu Thr Tyr Arg lie Val Asn Tyr Thr Pro Asp Leu 
10 15 20 25 

Pro Lys Asp Ala Val Asp Ser Ala Val Glu Lys Ala Leu 
30 35 

15 Lys Val Trp Glu Glu Val Thr Pro Leu Thr Phe Ser Arg 

40 45 50 

Leu Tyr Glu Gly Glu Ala Asp lie Met lie Ser Phe 
55 60 

20 

Ala Val Arg Glu His Gly Asp Phe Tyr Pro Phe Asp Gly 
65 70 75 

Pro Gly Asn Val Leu Ala His Ala Tyr Ala Pro Gly Pro 
25 80 85 90 

Gly lie Asn Gly Asp Ala His Phe Asp Asp Asp Glu Gin 

95 100 

30 Trp Thr Lys Asp Thr Thr Gly Thr Asn Leu Phe Leu Val 

105 110 us 



35 



Ala Ala His Glu lie Gly His Ser Leu Gly Leu Phe 
120 125 

His Ser Ala Asn Thr Glu Ala Leu Met Tyr Pro Leu Tyr 
130 135 * 140 



His Ser Leu Thr Asp Leu Thr Arg Phe Arg Leu Ser Gin 
40 145 150 

Asp Asp lie Asn Gly lie Gin Ser Leu Tyr Gly Pro Pro 
155 160 165 

45 Pro Asp Ser Pro Glu Thr Pro 

170 



50 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Thr Thr Pro lie lie His Leu Lys Gly Asp Ala 
15 5 10 

Asn lie Leu Leu Cys Leu Arg Tyr Arg Leu Ser Lys Tyr 
15 20 25 

20 Lys Gin Leu Tyr Glu Gin Val Ser Ser Thr Trp His Trp 

30 35 



25 



35 



45 



Thr Cys Thr Asp Gly Lys His Lys Asn Ala lie val Thr 
40 45 50 

Leu Thr Tyr lie Ser Thr Ser Gin Arg Asp Asp Phe Leu 
55 60 65 



Asn Thr Val Lys lie Pro Asn Thr Val Ser Val Ser Thr 
30 70 75 

Gly Tyr Met Thr lie 
80 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAAATGAAGA GTCTTCAA 18 



50 



WO 97/18471 



39 



PCT/US96/I8270 



10 



15 



35 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCGTCCCAGG TTCTGGAG 18 



(2) INFORMATION FOR SEQ ID NO: 5: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
30 ATACCATGGC CTATCCATTG GATGGAGC 28 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

45 

ATAGGATCCT TAGGTCTCAG GGGAGTCAGG 30 
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WHAT IS CLAIMED IS: 

1 . A process of screening compounds lo identify compounds that arc ligands 
that bind to a specific target molecule comprising the steps of: 

a) generating a first twodimensionai I5 N/*H NMR correlation 
spectrum of a 15 N-Iabeled target molecule; 

b) exposing the labeled target molecule to one or a mixture of chemical 
compounds; 

c) generating a second two-dimensional 15 N/*H NMR correlation 
spectrum of the labeled target molecule that has been exposed to one 
or a mixture of compounds in step (b); and 

d) comparing said first and second two-dimensional I5 N/*H NMR 
correlation spectra to determine differences between said first and 
said second spectra, the differences identifying the presence of one 
or more compounds that are ligands which have bound to the target 
molecule. 

2 . The process of claim 1 wherein the 15 N-labeIed target molecule is 
exposed to a mixture of chemical compounds in step (b), further 
comprising the steps subsequent to step d) of 

e) exposing the 15 N-labeled target molecule individually to 
each compound of said mixture, 

0 generating a two-dimensional 15 N/*H NMR correlation 
spectrum of the labeled target molecule that has been 
individually exposed to each compound; and 

g) comparing each spectrum generated in step 0 to said first 
spectrum to determine differences in any of those compared 
spectra, the differences identifying the presence of a 
compound that is a ligand which has bound to the target 
molecule. 

3 . The process of claim 1 wherein the differences in the two-dimensional 

N/ H NMR correlation spectra are chemical shifts at particular 15 N- 
labeled sites in the target molecule and chemical shifts in protons attached to 
those ^N-labeled sites. 
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4. The process of claim 1 wherein the target molecule is a polypeptide. 

5. A process of determining the dissociation constant between a target 
molecule and a ligand that binds to that target molecule comprising the steps 

of: 

a) generating a first two-dimensional ^N/ * H NMR correlation 
spectrum of a ^N-labeled target molecule; 

b) exposing the labeled target molecule to various concentrations of a 
ligand; 

c) generating a two-dimensional * ^N/ * H NMR correlation spectrum at 
each concentration of ligand from step (b); 

d) comparing each spectrum from step (c) both to the first spectrum 
from step (a) and to all other spectra from step (c) to quantify 
differences in those spectra as a function of changes in ligand 
concentration; and 

e) calculating the dissociation constant between the target molecule and 
the ligand from those differences according to the equation: 

K D = ([P] 0 -x)([L] 0 -x) 
x 

where [P]o is the total molar concentration of target 
molecule; 

[L]o is the total molar concentration of ligand; and 
x is the molar concentration of the bound species determined 
according to the equation: 

6 obs " fyree 

A — 1 

A 

where 6 0 b s and 6foe are the chemical shift values for the 

target molecule determined at each concentration of 
ligand and for the target molecule in the absence of 
Hgand, respective! y, and A is the difference between 
the chemical shift at saturating amounts of ligand 
and 6^. 



WO 97/18471 



PCT/US96/I8270 



42 



The process of claim 5 wherein the target molecule is a polypeptide. 

The process of claim 5 further comprising the step of binding the labeled 
target molecule to a second ligand before step (a). 
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Use of Nuclear Magnetic Resonance to Identify 
Ligands to Target Biomolecules 

Technical Field of the Invention 
5 The present invention pertains to a method for the screening of compounds 

for biological activity and to the determination of binding dissociation constants 
using two-dimensional ^N/'h NMR correlation spectral analysis to identify and 
design ligands that bind to a target biomolecule. 

10 Background of the Invention 

One of the most powerful tools for discovering new drug leads is random 

streening of synthetic chemical and natural product databases to discover 

compounds that bind to a particular target molecule (i.e., the identification of 

ligands of that target). Using this method, ligands may be identified by their 
15 abUity to form a physical association with a target molecule or by their ability to 

alter a function of a target molecule. 

When physical binding is sought, a target molecule is typically exposed to 

one or more compounds suspected of being ligands and assays are performed to 

determine if complexes between the target molecule and one or more of those 
20 compounds are formed. Such assays, as is well known in the art, test for gross 

changes in the target molecule (e.g., changes in size, charge, mobility) that indicate 

complex formation. 

Where functional changes are measured, assay conditions are established 

that allow for measurement of a biological or chemical event related to the target 
25 molecule (e.g., enzyme catalyzed reaction, receptor-mediated enzyme activation). 

To identify an alteration, the function of the target molecule is determined before 

and after exposure to the test compounds. 

Existing physical and functional assays have been used successfully to 

identify new drug leads for use in designing therapeutic compounds. There are, 
30 however, limitations inherent to those assays that compromise their accuracy, 

reliability and efficiency. 

A major shortcoming of existing assays relates to the problem of "false 

positives". In a typical functional assay, a "false positive" is a compound that 

triggers the assay but which compound is not effective in eliciting the desired 
35 physiological response. In a typical physical assay, a "false positive" is a 

compound that, for example, attaches itself to the target but in a non-specific 
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manner (e.g., non-specific binding). False positives are particularly prevalent and 
problematic when screening higher concentrations of putative ligands because 
many compounds have non-specific affects at those concentrations. 

In a similar fashion, existing assays are plagued by the problem of "false 
5 negatives", which result when a compound gives a negative response in the assay 
but which compound is actually a ligand for the target. False negatives typically 
occur in assays that use concentrations of test compounds that are either too high 
(resulting in toxicity) or too low relative to the binding or dissociation constant of 
the compound to the target. 

10 Another major shortcoming of existing assays is the limited amount of 

information provided by the assay itself. While the assay may correctly identify 
compounds that attach to or elicit a response from the target molecule, those assays 
typically do not provide any information about either specific binding sites on the 
target molecule or structure activity relationships between the compound being 

1 5 tested and the target molecule. The inability to provide any such information is 
particularly problematic where the screening assay is being used to identify leads 
for further study. 

It has recently been suggested that X-ray crystallography can be used to 
identify the binding sites of organic solvents on macromolecules. However, this 

20 method cannot determine the relative binding affinities at different sites on the 

target. It is only applicable to very stable target proteins that do not denature in the 
presence of high concentrations of organic solvents. Moreover, this approach is 
not a screening method for rapidly testing many compounds that are chemically 
diverse, but is limited to mapping the binding sites of only a few organic solvents 

25 due to the long time needed to determine the individual crystal structures. 

Compounds are screened to identify leads that can be used in the design of 
new drugs that alter the function of the target biomolecule. Those new drugs can 
be structural analogs of identified leads or can be conjugates of one or more such 
lead compounds. Because of the problems inherent to existing screening methods, 

30 those methods are often of little help in designing new drugs. 

There continues to be a need to provide new, rapid, efficient, accurate and 
reliable means of screening compounds to identify and design ligands that 
specifically bind to a particular target 
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Brief Summary of the Invention 
In one aspect, the present invention provides a process of screening 
compounds for biological activity to identify ligands that bind to a specific target 
5 molecule. That process comprises the steps of: a) generating a first two- 
dimensional *^N/'h NMR correlation spectrum of a ^N-labeled target molecule; 
b) exposing the labeled target molecule to one or a mixture of chemical 
compounds; c) generating a second two-dimensional ^N/'h NMR correlation 
spectrum of the labeled target molecule that has been exposed to one or a mixture 

10 of compounds in step (b); and d) comparing said first and second two-dimensional 
^N/' H NMR correlation spectra to determine differences between said first and 
said second spectra, the differences identifying the presence of one or more 
compounds that are ligands which have bound to the target molecule. 

Where the process of the present invention screens more than one 

15 compound in step (b), that is, a mixture of compounds, and where a difference 
between the first spectrum generated from the target molecule alone and that 
generated from the target molecule in the presence of the mixture, additional steps 
are performed to identify which specific compound or compounds contained in the 
mixture is binding to the target molecule. Those additional steps comprise the 

20 steps of e) exposing the l5 N-labeled target molecule individually to each 
compound of the mixture, f) generating a two-dimensional '^N/*H NMR 
correlation spectrum of the labeled target molecule that has been individually 
exposed to each compound; and g) comparing each spectrum generated in step f) to 
the first spectrum generated from the target molecule alone to determine differences 

25 in any of those compared spectra, the differences identifying the presence of a 
compound that is a ligand which has bound to the target molecule. 

Because the chemical shift values of the particular *^N/*H signals in the 
two-dimensional correlation spectrum correspond to known specific locations of 
atomic groupings in the target molecule (e.g., the N-H atoms of the amide or 

30 peptide link of a particular amino acid residue in a polypeptide), the process of the 
present invention allows not only for the for identification of which compound(s) 
bind to a particular target molecule, but also permit the determination of the 
particular binding site of the ligand on the target molecule. 

In a second aspect, the present invention provides a process of determining 

35 the dissociation constant, Kd, for a given ligand and its target molecule. That 

process comprises the steps of a) generating a first two-dimensional 15 N/*H NMR 
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correlation spectrum of a 15 N-labeled target molecule; b) exposing the labeled 
target motecule to various concentrations of a ligand; c) generating a two- 
dimensional ^N/* H NMR correlation spectrum at each concentration of ligand in 
step (b); d) comparing each spectrum from step (c) to the first spectrum from step 
(a); and e) calculating the dissociation constant between the target molecule and the 
ligand from those differences according to the equation: 

K D = ([Pj 0 -x)([L) o -x) 
x 

An advantageous aspect of the present invention is the capability of the 
process of the present invention to determine the dissociation constant of one 
ligand of the target molecule in the presence of a second molecule already bound to 
the ligand. This is generally not possible with prior art methods which employ 
"wet chemical" analytical methods of determining binding of a ligand to a target 
molecule substrate. 

In this preferred embodiment, the process of determining the dissociation 
constant of a ligand can be performed in the presence of a second bound ligand. In 
accordance with this embodiment, the ^N-labeled target molecule is bound to that 
second ligand before exposing that target to the test compounds. 

The ability of the present method to determine not only the existence of 
binding between one ligand and the target molecule, but also the particular site of 
binding in the presence of a second bound ligand permits the capability to design a 
drug that comprises two or more linked moieties made up of the ligands. 

This method uses the two-dimensional ^N/^H NMR correlation 
spectroscopic screening process as set forth above to identify a first and 
subsequent ligands that bind to the target molecule. A complex of the target 
molecule and two or more ligands is formed and the three-dimensional structure of 
that complex is determined preferably using NMR spectroscopy or X-ray 
crystallography. That three-dimensional structure is used to determine the spatial 
orientation of the ligands relative to each other and to the target molecule. 

Based on the spatial orientation, the ligands are linked together to form the 
drug. The selection of an appropriate linking group is made by maintaining the 
spatial orientation of the ligands to one another and to the target molecule based 
upon principles of bond angle and bond length information well known in the 
organic chemical art. 
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Thus, the molecular design method comprises identifying a first ligand 
moiety to the target molecule using two-dimensional *^N/*H NMR correlation 
spectroscopy; identifying subsequent ligand moieties to the target molecule using 
5 two-dimensional *^N/*H NMR correlation spectroscopy; forming a complex of 
the first and subsequent ligand moieties to the target molecule; determining the 
three dimensional structure of the complex and, thus, the spatial orientation of the 
first and subsequent ligand moieties on the target molecule; and linking the first 
and subsequent ligand moieties to form the drug to maintain the spatial orientation 
10 of the ligand moieties. 

The identification of subsequent ligand moieities can be performed in the 
absence or presence of the first ligand (e.g., the target molecule can be bound to 
the first ligand before being exposed to the test compounds for identification of the 
second ligand). 

15 In a preferred embodiment, the target molecule used in a screening or 

design process is a polypeptide. The polypeptide target is preferably produced in 
recombinant form from a host cell transformed with an expression vector that 
contains a polynucleotide that encodes the polypeptide, by culturing the 
transformed host cell in a medium that contains an assimilable source of such 

20 that the recombinantly produced polypeptide is labeled with ' ^N. 



Brief Description of the Drawings 
In the drawings which form a portion of the specification: 
FIG. 1 shows a *^N/*H correlation spectrum of the DNA binding domain of 
25 uniformly ^N-Iabeled human papillomavirus E2. The spectrum (80 

complex points, 4 scans/fid) was acquired on a 0.5 mM sample of E2 in 20 
mM phosphate (pH 6.5), 10 mM dithiothreitol (DTT) and 10% deuterium 
oxide (D2O). 

FIG. 2 shows *^N/*H correlation spectra of the DNA binding domain of 
30 uniformly ^N-labeled human papillomavirus E2 before (thin multiple 

contours) and after (thick single contours) addition of a final test 
compound. The final concentration of compound was 1.0 mM. All other 
conditions are as stated in FIG. 1. Selected residues that show significant 
changes upon binding are indicated. 
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FIG. 3 shows *^N/*H correlation spectra of the DNA binding domain of 

uniformly ^N-labeled human papillomavirus E2 before (thin multiple 
contours) and after (thick single contours) addition of a second test 
5 compound. The final concentration of compound was 1 .0 mM. All other 

conditions are as stated in FIG. 1 . Selected residues that show significant 
changes upon binding are indicated. 
FIG. 4 shows *^N/*H correlation spectra of the catalytic domain of uniformly 
^N-labeled stromelysin before (thin multiple contours) and after (thick 

10 single contours) addition of a test compound. The final concentration of 

compound was 1 .0 mM. The spectra (80 complex points, 8 scans/fid) 
were acquired on a 0.3 mM sample of SCD in 20 mM TRIS (pH 7.0), 20 
mM CaCl2 and 10% D2O. Selected residues that show significant changes 
upon binding are indicated. 

15 FIG. 5 shows *~*N/*H correlation spectra of the Ras-binding domain of uniformly 
'^NMabeled RAF peptide (residues 55-132) before (thin multiple contours) 
and after (thick single contours) addition of a test compound. The final 
concentration of compound was 1.0 mM. The spectra (80 complex points, 
8 scans/fid) were acquired on a 0.3 mM sample of the RAF fragment in 20 

20 mM phosphate (pH 7.0), 10 mM DTT and 10% D2O. Selected residues 

that show significant changes upon binding are indicated. 
FIG. 6 shows *^N/*H correlation spectra of uniformly ^N-labeled FKBP before 
(thin multiple contours) and after (thick single contours) addition of a test 
compound. The final concentration of compound was 1.0 mM. The 

25 spectra (80 complex points, 4 scans/fid) was acquired on a 0.3 mM sample 

of FKBP in 50 mM phosphate (pH 6.5), 100 mM NaCl and 10% D2O. 
Selected residues that show significant changes upon binding are indicated. 
FIG. 7 shows a first depiction of the NMR-derived structure of the DNA-binding 
domain of E2. The two monomers of the symmetric dimer are oriented in a 

30 top-bottom fashion, and the N- and C-termini of each monomer are 

indicated (N and C for one monomer, N* and C* for the other). Shown in 
ribbons are the residues which exhibit significant chemical shift changes 
(A8( 1 H)>0.04 ppm; A8( 15 N) >0. 1 ppm) upon binding to a first test 
compound. These residues correspond to the DNA-recognition helix of 

35 E2. Selected residues are numbered for aid in visualization. 
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FIG. 8 shows a second depiction of the NMR-derived structure of the DNA- 
binding domain of E2. The two monomers of the symmetric dimer are 
oriented in a top-bottom fashion, and the N- and C-termini of each 
5 monomer are indicated (N and C for one monomer, N* and C* for the 

other). Shown in ribbons are the residues which exhibit significant 
chemical shift changes (A5( 1 H)>0.04 ppm; A6( 15 N) >0.1 ppm) upon 
binding to a second test compound. These residues are located primarily in 
the dimer interface region. Selected residues are numbered for aid in 
10 visualization. 

FIG. 9 shows a depiction of the NMR-derived structure of the catalytic domain of 
stromelysin. The N- and C-termini are indicated. Shown in ribbons are 
the residues which exhibit significant chemical shift changes (A8( 1 H)>0.04 
ppm; A5(^N) >0.1 ppm) upon binding to a test compound. These either 
15 form part of the S V binding site or are spatially proximal to this site. 

Selected residues are numbered for aid in visualization. 

FIG. 10 shows a ribbon plot of a ternary complex of first and second ligands 
bound to the catalytic domain of stromelysin. 



20 Detailed Description of the Invention 

The present invention provides a rapid and efficient screening method for 
identifying ligands that bind to therapeutic target molecules. 

Ligands are identified by testing the binding of molecules to a target 
molecule (e.g., protein, nucleic acid, etc.) by following, with nuclear magnetic 
25 resonance (NMR) spectroscopy, the changes in chemicaL shifts of the target 
molecule upon the addition of the ligand compounds in the database. 

From an analysis of the chemical shift changes of the target molecule as a 
function of ligand concentration, the binding affinities of ligands for biomolecules 
are also determined. 

30 The location of the binding site for each ligand is determined from an 

analysis of the chemical shifts of the biomolecule that change upon the addition of 
the ligand and from nuclear Overhauser effects (NOEs) between the ligand and 
biomolecule. 

Information about the structure/activity relationships between ligands 
35 identified by such a process can then be used to design new drugs that serve as 
ligands to the target molecule. By way of example, where two or more ligands to 
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a given target molecule are identified, a complex of those ligands and the target 
molecule is formed. The spatial orientation of the ligands to each other as well as 
to the target molecule is derived from the three-dimensional structure. That spatial 
orientation defines the distance between the binding sites of the two ligands and the 
5 orientation of each ligand to those sites. 

Using that spatial orientation data, the two or more ligands are then linked 
together to form a new ligand. Linking is accomplished in a manner that maintains 
the spatial orientation of the ligands to one another and to the target molecule. 

There are numerous advantages to the NMR-based discovery process of 

1 o the present invention. First, because a process of the present invention identifies 
ligands by direcdy measuring binding to the target molecule, the problem of false 
positives is significantly reduced. Because the present process identifies specific 
binding sites to the target molecule, the problem of false positives resulting from 
the non-specific binding of compounds to the target molecule at high 

1 5 concentrations is eliminated. 

Second, the problem of false negatives is significantly reduced because the 
present process can identify compounds that specifically bind to the target molecule 
with a wide range of dissociation constants. The dissociation or binding constant 
for compounds can actually be determined with the present process. 

20 Other advantages of the present invention result from the variety and 

detailed data provided about each ligand from the discovery process. 

Because the location of the bound ligand can be determined from an 
analysis of the chemical shifts of the target molecule that change upon the addition 
of the ligand and from nuclear Overhauser effects (NOEs) between the ligand and 

25 biomolecule, the binding of a second ligand can be measured in the presence of a 
first ligand that is already bound to the target. The ability to simultaneously 
identify binding sites of different ligands allows a skilled artisan to 1) define 
negative and positive cooperative binding between ligands and 2) design new 
drugs by linking two or more ligands into a single compound while maintaining a 

30 proper orientation of the ligands to one another and to their binding sites. 

Further, if multiple binding sites exist, the relative affinity of individual 
binding moieties for the different binding sites can be measured from an analysis 
of the chemical shift changes of the target molecule as a function of the added 
concentration of the ligand. By simultaneously screening numerous structural 

35 analogs of a given compound, detailed structure/activity relationships about ligands 
is provided. 
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In its principal aspect, the present invention provides a process of 
screening compounds to identify ligands that bind to a specific target molecule. 
That process comprises the steps of: a) generating a first two-dimensional '^N/*H 
NMR correlation spectrum of a ^N-labeled target molecule; b) exposing the 
5 labeled target molecule to one or more compounds; c) generating a second two- 
dimensional *^N/*H NMR correlation spectrum of the labeled target molecule that 
has been exposed to the compounds of step (b); and d) comparing the first and 
second spectra to determine whether differences in those two spectra exist, which 
differences indicate the presence of one or more ligands that have bound to the 

1 o target molecule. 

Where a process of the present invention screens more than one compound 
in step (b) and where a difference between spectra is observed, additional steps are 
performed to identify which specific compound is binding to the target molecules. 
Those additional steps comprise generating a two-dimensional *"*N/*H NMR 

15 correlation spectrum for each individual compound and comparing each spectrum 
to the first spectrum to determine whether differences in any of those compared 
spectra exist, which differences indicate the presence of a ligand that has bound to 
the target molecule. 

Any ^N-labeled target molecule can be used in a process of the present 

20 invention. Because of the importance of proteins in medicinal chemistry, a 

preferred target molecule is a polypeptide. The target molecule can be labeled with 
using any means well known in the art. In a preferred embodiment, the target 
molecule is prepared in recombinant form using transformed host cells. In an 
especially preferred embodiment, the target molecule is a polypeptide. Any 

25 polypeptide that gives a high resolution NMR spectrum.and can be partially or 
uniformly labeled with *^N can be used. The preparation of uniformly *^N- 
labeled exemplary polypeptide target molecules is set forth hereinafter in the 
Examples. 

A preferred means of preparing adequate quantities of uniformly ^N- 
30 labeled polypeptides is to transform a host cell with an expression vector that 

contains a polynucleotide that encodes that polypeptide and culture the transformed 
cell in a culture medium that contains assimilable sources of ^N. Assimilable 
sources of 15 N are well known in the art. A preferred such source is ^NH4d 
Means for preparing expression vectors that contain polynucleotides 
35 encoding specific polypeptides are well known in the art In a similar manner, 

means for transforming host cells with those vectors and means for culturing those 
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transformed cells so that the polypeptide is expressed are also well known in the 
art. 

The screening process of the present invention begins with the generation 
or acquisition of a two-dimensional ^N/' H correlation spectrum of the labeled 
5 target molecule. Means for generating two-dimensional ^N/^H correlation 
spectra are well known in the art (see, e.g., D. A. Egan et aL, Biochemistry . 
32(8): 1920-1927 (1993); Bax, A., Grzesiek, S., Acc. Chem. Res.. 26(4); 131- 
138 (1993)). 

The NMR spectra that are typically recorded in the screening procedure of 
the present invention are two-dimensional *^N/*H heteronuclear single quantum 
correlation (HSQC) spectra. Because the *^N/*H signals corresponding to the 
backbone amides of the proteins are usually well-resolved, the chemical shift 
changes for the individual amides are readily monitored. 

In generating such spectra, the large water signal is suppressed by spoiling 
gradients. To facilitate the acquisition of NMR data on a large number of 
compounds (e.g., a database of synthetic or naturally occurring small organic 
compounds), a sample changer is employed. Using the sample changer, a total of 
60 samples can be run unattended. Thus, using the typical acquisition parameters 
(4 scans per free induction decay (fid), 100-120 HSQC spectra can be acquired in 
a 24 hour period. 

To facilitate processing of the NMR data, computer programs are used to 
transfer and automatically process the multiple two-dimensional NMR data sets, 
including a routine to automatically phase the two-dimensional NMR data. The 
analysis of the data can be facilitated by formatting the data so that the individual 
HSQC spectra are rapidly viewed and compared to the HSQC spectrum of the 
control sample containing only the vehicle for the added compound (DMSO), but 
no added compound. Detailed descriptions of means of generating such two- 
dimensional *^N/*H correlation spectra are set forth hereinafter in the Examples. 

A representative two-dimensional *^N/*H NMR correlation spectrum of an 
^N-labeled target molecule (polypeptide) is shown in FIG. 1 (the DNA-binding 
domain of the E2 protein). 

Following acquisition of the first spectrum, the labeled target molecule is 
exposed to one or more test compounds. Where more than one test compound is 
to be tested simultaneously, it is preferred to use a database of compounds such as 
a plurality of small molecules. Such molecules are typically dissolved in 
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perdeuterated dimethylsulfoxide . The compounds in the database can be 
purchased from vendors or created according to desired needs. 

Individual compounds can be selected inter alia on the basis of size 
(molecular weight = 100-300) and molecular diversity. Compounds in the 
5 collection can have different shapes (e.g., flat aromatic rings(s), puckered aliphatic 
rings(s), straight and branched chain aliphatics with single, double, or triple 
bonds) and diverse functional groups (e.g., carboxylic acids, esters, ethers, 
amines, aldehydes, ketones, and various heterocyclic rings) for maximizing the 
possibility of discovering compounds that interact with widely diverse binding 
10 sites. 

The NMR screening process of the present invention utilizes ligand 
concentrations ranging from about 0. 1 to about 10.0 mM. At these concentrations, 
compounds which are acidic or basic can significantly change the pH of buffered 
protein solutions. Chemical shifts are sensitive to pH changes as well as direct 

1 5 binding interactions, and "false positive" chemical shift changes, which are not the 
result of ligand binding but of changes in pH, can therefore be observed. It is thus 
necessary to ensure that the pH of the buffered solution does not change upon 
addition of the ligand. One means of controlling pH is set forth below. 

Compounds are stored at 263*K as 1.0 and 0.1 M stock solutions in 

20 dimethylsulfoxide (DMSO). This is necessary because of the limited solubility of 
the ligands in aqueous solution. It is not possible to directly adjust the pH of the 
DMSO solution. In addition, HC1 and NaOH form insoluble salts in DMSO, so 
alternative acids and bases must be used. The following approach has been found 
to result in stable pH. 

The 1 .0 M stock solutions in DMSO are diluted 1:10 in 50 mM phosphate, 
pH 7.0. The pH of that diluted aliquot solution is measured. If the pH of the 
aliquot is unchanged (i.e., remains at 7.0), a working solution is made by diluting 
the DMSO stock solution 1:10 to make aO.l M solution and that solution is stored. 
If the pH of the diluted aliquot is less than 7.0, ethanolamine is added to 
30 the 1 .0 M stock DMSO solution, that stock solution is then diluted 1 : 1 0 with 
phosphate buffer to make another aliquot, and the pH of the aliquot rechecked. 

If the pH of the diluted aliquot is greater than 7.0, acetic acid is added to 
the 1.0 M stock DMSO solution, that stock solution is then diluted 1:10 with 
phosphate buffer to make another aliquot, and the pH of the aliquot rechecked. 



25 



35 
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Ethanolamine and acetic acid are soluble in DMSO, and the proper 
equivalents are added to ensure that upon transfer to aqueous buffer, the pH is 
unchanged. Adjusting the pH is an interactive process, repeated until the desired 
result is obtained. 

5 Note that this procedure is performed on 1:10 dilutions of 1 .0 M stock 

solutions (100 mM ligand) to ensure that no pH changes are observed at the lower 
concentrations used in the experiments (0.1 to 10 mM) or in different/weaker 
buffer systems. 

FoDowing exposure of the 15 N-Iabeled target molecule to one or more test 
10 compounds, a second two-dimensional 15 N/*H NMR correlation spectrum is 
generated. That second spectrum is generated in the same manner as set forth 
above. The first and second spectra are then compared to determine whether there 
are any differences between the two spectra. Differences in the two-dimensional 
N/ H NMR correlation spectra that indicate the presence of a ligand correspond 
15 to 1 5 N-labeled sites in the target molecule. Those differences are determined using 
standard procedures well known in the art 

By way of example, FIGs. 2, 3, 4, 5 and 6 show comparisons of 
correlation spectra before and after exposure of various target molecules to various 
test compounds. A detailed description of how these studies were performed can 
20 be found hereinafter in Examples 2 and 3. 

Particular signals in a two-dimensional ^N/*H correlation spectrum 
correspond to specific nitrogen and proton atoms in the target molecule (e.g., 
particular amides of the amino acid residues in the protein). By way of example, it 
can be seen from FIG. 2 that chemical shifts in a two-dimensional 15 N/*H 
25 correlation of the DNA-binding domain of E2 exposed to a test compound 
occurred at residue positions 15 (115), 21 (Y21), 22 (R22) and 23 (L23). 

It can be seen from FIG. 2 that the binding of the ligand involved the 
isoleucine (He) residue at position 15, the tyrosine (Tyr) residue at position 21, the 
arginine (Arg) residue at position 22 and the leucine (Leu) residue at position 23. 
30 Thus, a process of the present invention can also be used to identify the specific 
binding site between a ligand and target molecule. 

The region of the protein that is responsible for binding to the individual 
compounds is identified from the particular amide signals that change upon the 
addition of the compounds. These signals are assigned to the individual amide 
35 groups of the protein by standard procedures using a variety of well-established 
heteronuclear multi -dimensional NMR experiments. 
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To discover molecules that bind more tightly to the protein, molecules are 
selected for testing based on the structure/activity relationships from the initial 
screen and/or structural information on the initial leads when bound to the protein. 
By way of example, the initial screening may result in the identification of ligands, 

5 all of which contain an aromatic ring. The second round of screening would then 
use other aromatic molecules as the test compounds. 

As set forth hereinafter in Example 2, an initial screening assay for binding 
to the catalytic domain of stromelysin identified two biaryl compounds as ligands. 
The second round of screening thus used a series of biaryl derivatives as the test 

10 compounds. 

The second set of test compounds are initially screened at a concentration 
of 1 mM, and binding constants are measured for those that show affinity. Best 
leads that bind to the protein are then compared to the results obtained in a 
functional assay. Those compounds that are suitable leads are chemically modified 

1 5 to produce analogs with the goal of discovering a new pharmaceutical agent. 

In another aspect, the present invention provides a process for determining 
the dissociation constant between a target molecule and a ligand that binds to that 
target molecule. That process comprises the steps of: a) generating a first two- 
dimensional ^N/*H NMR correlation spectrum of a ^N-labeled target molecule; 

20 b) titrating the labeled target molecule with various concentrations of a ligand; c) 
generating a two-dimensional *^N/*H NMR correlation spectrum at each 
concentration of ligand from step (b); d) comparing each spectrum from step (c) 
both to the first spectrum from step (a) and to aU other spectra from step (c) to 
quantify differences in those spectra as a function of changes in ligand 

25 concentration; and e) calculating the dissociation constant (Kd) between the target 
molecule and the ligand from those differences. 

Because of their importance in medicinal chemistry, a preferred target 
molecule for use in such a process is a polypeptide. In one preferred embodiment, 
a process of determining the dissociation constant of a ligand can be performed in 

30 the presence of a second ligand. In accordance with this embodiment, the *^N- 
labeled target molecule is bound to that second ligand before exposing that target to 
the test compounds. 

Binding or dissociation constants are measured by following the *^N/*H 
chemical shifts of the protein as a function of ligand concentration. A known 

35 concentration (fP]o) of the target moleule is mixed with a known concentration 
([L]o) of a previously identified ligand and the two-dimensional ^N/*H 
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correlation spectrum was acquired. From this spectrum, observed chemical shift 
values (5 0 bs) arc obtained. The process is repeated for varying concentrations of 
the ligand to the point of saturation of the target molecule, when possible, in which 
case the limiting chemical shift value for saturation (5^) is measured. 
5 In those situations where saturation of the target molecule is achieved, the 

dissociation constant for the binding of a particular ligand to the targer molecule is 
calculated using the formula: 

K D 

X 

where [PJo is the total molar concentration of target molecule; [L]o is the total 
1 o molar concentration of ligand; and x is the molar concentration of the bound 
species. The value of x is determined from the equation: 

x _ ^obs - S free 



15 where 5^ is the chemical shift of the free species; 5obs is the observed chemical 
shift; and A is the differenec between the limiting chemical shift value for 
saturation (Ssat) an d the chemical shift value of the target molecule free of ligand 
(Sfree). 

The dissociation constant is then determined by varying its value until a 
20 best fit to the observed data is obtained using standard curve-fitting statistical 

methods. In the case where 5$at IS not directly known, both Kp and 8sai are varied 
and subjected to the same curve-fitting procedure. 

The use of the process of the present invention to determine the 
dissociation or binding affinity of various ligands to various target molecules is set 
25 forth hereinafter in Examples 2 and 3. 

Preferred target molecules, means for generating spectra, and means for 
comparing spectra are the same as set forth above. 

The initial step in the design process is the identification of two or more 
ligands that bind to the specific target molecule. The identification of such ligands 
30 is done using two-dimensional *^N/*H NMR correlation spectroscopy as set forth 
above. 

Once two or more ligands are identified as binding to the target molecule at 
different sites, a complex between the target molecule and ligands is formed. 
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Where there are two ligands, that complex is a ternary complex. Quaternary and 
other complexes are formed where there are three or more ligands. 

Complexes are formed by mixing the target molecule simultaneously or 
sequentially with the various ligands under circumstances that allow those ligands 
5 to bind the target. Means for determining those conditions are well known in the 
art. 

Once that complex is formed, its three-dimensional structure is determined. 

Any means of determining three-dimensional structure can be used. Such methods 

are well known in the art. Exemplary and preferred methods are NMR and X-ray 
10 crystallography. The use of three-dimensional double- and triple resonance NMR 

to determine the three-dimensional structure of two ligands bound to the catalytic 

domain of stromelysin is set forth in detail hereinafter in Example 4. 

An analysis of the three-dimensional structure reveals the spatial orientation 

of the ligands relative to each other as well as to the conformation of the target 
15 molecule. First, the spatial orientation of each ligand to the target molecule allows 

for identification of those portions of the ligand direcdy involved in binding (i.e., 

those portions interacting with the target binding site) and those portions of each 

ligand that project away from the binding site and which portions can be used in 

subsequent linking procedures. 
20 Second, the spatial orientation data is used to map the positions of each 

ligand relative to each other. In other words, discrete distances between the 

spatially oriented ligands can be calculated. 

Third, the spatial orientation data also defines the three-dimensional 

relationships amongst the ligands and the target. Thus, in addition to calculating 
25 the absolute distances between ligands, the angular orien&tions of those ligands 

can also be determined. 

Knowledge of the spatial orientations of the ligands and target is then used 

to select linkers to link two or more ligands together into a single entity that 

contains all of the ligands. The design of the linkers is based on the distances and 
30 angular orientation needed to maintain each of the ligand portions of the single 

entity in proper orientation to the target. 

The three-dimensional conformation of suitable linkers is well known or 

readily ascertainable by one of ordinary skill in the art. While it is theoretically 

possible to link two or more ligands together over any range of distance and three- 
35 dimensional projection, in practice certain limitations of distance and projection are 

preferred. In a preferred embodiment, ligands are separated by a distance of less 
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than about 15 Angstroms (A), more preferably less than about 10 A and, even 
more preferably less than about 5 A. 

Once a suitable linker group is identified, the ligands are linked with that 
linker. Means for linking ligands are well known in the art and depend upon the 
5 chemical structure of the ligand and the linking group itself. Ligands are linked to 
one another using those portions of the ligand not directly involved in binding to 
the target molecule. 

A detailed description of the design of a drug that inhibits the proteolytic 
activity of stromelysin, which drug was designed using a process of the present 
to invention is set forth hereinafter in Example 4. 

The following Examples illustrate preferred embodiments of the present 
invention and are not limiting of the specification and claims in any way. 

Example 1 

15 Preparation Of Uniformly ^N-Labeled Target Molecules 
A. Stromelvsin 

Human stromelysin is a 447-amino acid protein believed to be involved in 
proteolytic degradation of cartilage. Cartilage proteolysis is believed to result in 
degradative loss of joint cartilage and the resulting impairment of joint function 

20 observed in both osteoarthritis and rheumatoid arthritis. The protein possesses a 
series of domains including N-terminal latent and propetide domains, a C-terminal 
domain homologous with homopexin, and an internal catalytic domain. 

Studies have shown that removal of the N-terminal prosequence of 
approximately eighty amino acids occurs to converuhe proenzyme to the 45 kDa 

25 mature enzyme. Furthermore, studies have shown that the* C-terminal homopexin 
homologous domain is not required for proper folding of the catalytic domain or 
for interaction with an inhibitor. (See, e.g., A. 1. Marcy, Biochemistry . 30: 6476- 
6483 (1991). Thus, the 81-256 amino acid residue internal segment of 
stromelysin was selected as the protein fragment for use in identifying compounds 

30 which bind to and have the potential as acting as inhibitors of stromelysin. 

To employ the method of the present invention, it was necessary to prepare 
the 81-256 fragment (SEQ ID NO:l) of stromelysin in which the peptide backbone 
was isotopically enriched with and ^N. This was done by inserting a plasmid 
which coded for the production of the protein fragment into an E. coli strain and 

35 growing the genetically-modified bacterial strain in a limiting culture medium 
enriched with 15 NH4C1 and 13 C-glucose. 
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The isotopically enriched protein fragment was isolated from the culture 
medium, purified, and subsequently used as the basis for evaluating the binding of 
test compounds. The procedures for these processes are described below. 

Human skin fibroblasts (ATCC No, CRL 1507) were gTown and induced 
5 using the procedure described by Clark et al., Archiv. Biochem. and Biophvs .. 
241: 36-45 (1985). Total RNA was isolated from 1 g of cells using a Promega 
RN Agents® Total RNA Isolation System Kit (Cat.# Z51 10, Promega Corp., 2800 
Woods Hollow Road, Madison, WI 5371 1-5399) following the manufacturers 
instructions. A 1 (ig portion of the RNA was heat-denatured at 80*C for five 

10 minutes and then subjected to reverse transcriptase PCR using a GeneAmp® RNA 
PCR kit (Cat# N808-O017, Applied Biosystems/Perkin-Elmer, 761 Main Avenue, 
Norwalk, CT 06859-0156) following the manufacturer's instructions. 

Nested PCR was performed using first primers (A) GAAATGAAGAGTC 
TTCAA (SEQ ID NO:3) and (B) GCGTCCCAGGTTCTGGAG (SEQ ID NO:4) 

1 5 and thirty-five cycles of 94*C, two minutes; 45°C, two minutes; and 72°C three 
minutes. This was followed by reamplification with internal primers (C) 
ATACCATGGCCTATCCAT TGGATGGAGC (SEQ ID NO:5) and (D) 
ATAGGATCCTTAGGTCTCAGGGGA GTCAGG (SEQ ID NO:6) using thirty 
cycles under the same conditions described immediately above to generate a DNA 

20 coding for amino acid residues 1-256 of human stromelysin. 

The PCR fragment was then cloned into PCR cloning vector pT7Blue(R) 
(Novagen, Inc., 597 Science Drive, Madison, WI 5371 1) according to the 
manufacturer's instructions. The resulting plasmid was cut with Ncol and BamHI 
and the stromelysin fragment was subcloned into the Novagen expression vector 

25 pET3d (Novagen, Inc., 597 Science Drive, Madison, WI 5371 1), again using the 
manufacturer's instructions. 

A mature stromelysin expression construct coding for amino acid residues 
81-256 plus an initiating methionine was generated from the 1-256 expression 
construct by PCR amplification. The resulting PCR fragment was first cloned into 

30 the Novagen pT7Blue(R) vector and then subcloned into the Novagen pET3d 
vector, using the manufacturer's instructions in the manner described above, to 
produce plasmid (pETST-83-256). This final plasmid is identical to that described 
by Qi-Zhuang et al., Biochemistry. 31: 11231-1 1235 (1992) with the exception 
that the present codes for a peptide sequence beginning two amino acids earlier, at 

35 position 81 in the sequence of human stromelysin. 
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Plasmid pETST-83-256 was transformed into £. coli strain 
BL21(DE3)/pLysS (Novagen, Inc., 597 Science Drive, Madison, WI 5371 1) in 
accordance with the manufacturer's instructions to generate an expression strain, 
BL2 1 (DE3)/pLysS/pETST-255- 1 . 

A preculture medium was prepared by dissolving 1.698 g of 
Na2HP4*7H20, 0.45 g of KH2PO4, 0.075 g NaCI, 0.150 g ^NH^l, 0.300 
13C-glucose, 300 nL of 1 M aqueous MgS04 solution and 1 5 |lL of aqueous 
CaCl2 solution in 150 mL of deionized water. 

The resulting solution of preculture medium was sterilized and transferred 
to a sterile 500 mL baffle flask. Immediately prior to inoculation of the preculture 
medium with the bacterial strain, 150 |iL of a solution containing 34 mg/mL of 
chloramphenicol in 100% ethanol and 1.5 mL of a solution containing 20 mg/mL 
of ampicillin were added to the flask contents. 

The flask contents were then inoculated with 1 mL of glycerol stock of 
genetically-modified E. Coli y strain BL21(DE3)/pLysS/pETST-255-l. The flask 
contents were shaken (225 rpm) at 37°C until an optical density of 0.65 was 
observed. 

A fermentation nutrient medium was prepared by dissolving 1 13.28 g of 
Na2HP4»7H20, 30 g of KH2PO4, 5 g NaCI and 10 mL of 1 % DF-60 antifoam 
agent in 9604 mL of deionized water. This solution was placed in a New 
Brunswick Scientific Micros Fermenter (Edison, NJ) and sterilized at 121 °C for 40 
minutes. 

Immediately prior to inoculation of the fermentation medium, the following 
pre-sterilized components were added to the fermentation vessel contents: 100 mL 

of a 10% aqueous solution of ^NfLjCI, 100 mL of a 10% aqueous solution of 
13 

C-glucose, 20 mL of an aqueous 1M solution of MgS04, 1 mL of an aqueous 
1M CaCl2 solution, 5 mL of an aqueous solution of thiamin hydrochloride (10 
mg/mL), 10 mL of a solution containing 34 mg/mL of chloramphenicol in 100% 
ethanol and L9 g of ampicillin dissolved in the chloramphenicol solution. The pH 
of the resulting solution was adjusted to pH 7.00 by the addition of an aqueous 
solution of 4N H2SO4. 

The preculture of £. Coli , strain BL21 (DE3)/pLysS/pETST-255-l , from 
the shake-flask scale procedure described above was added to the fermentor 
contents and cell growth was allowed to proceed until an optical density of 0.48 
was achieved. During this process, the fermenter contents were automatically 
maintained at pH 7.0 by the addition of 4N H2SO4 or 
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4N KOH as needed. The dissolved oxygen content of the fermenter contents was 
maintained above 55% air saturation through a cascaded loop which increased 
agitation speed when the dissolved oxygen content dropped below 55%. Air was 
fed to the fermenter contents at 7 standard liters per minute (SLPM) and the culture 
5 temperature was maintained at 37 *C throughout the process. 

The cells were harvested by centrifugation at 17,000 x g for 10 minutes at 
4°C and the resulting cell pellets were collected and stored at -85*C. The wet cell 
yield was 3.5 g/L. Analysis of the soluble and insoluble fractions of cell lysates 
by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) 
10 revealed that approximately 50% of the '^N-stromelysin was found in the soluble 
phase. 

The isotopicaUy-labeled stromelysin fragment prepared as described above 
was purified employing a modification of the technique described by Ye et al » 
Biochemistry, 31: 11231-11235 (1992). 

15 The harvested cells were suspended in 20 mM Tris-HCl buffer (pH 8.0) 

sodium azide solution containing 1 mM MgCl2, 0.5 mM ZnCl2> 25 units/mL of 
Benzonase® enzyme, and an inhibitor mixture made up of 4-(2-aminoethyl)- 
benzenesulfonyl fluoride ("AEBSF*), Leupeptin®, Aprotinin®, and Pepstatin® 
(all at concentrations of 1 ng/mL. AEBSF, Leupeptin®, Aprotinin®, and 

20 Pepstatin® are available from American International Chemical, 17 Strathmore 
Road, Natick, MA 01760.) 

The resulting mixture was gently stirred for one hour and then cooled to 
4°C. The cells were then sonically disrupted using a 50% duty cycle. The 
resulting lysate was centrifuged at 14,000 rpm for 30 minutes and the pellet of 

25 insoluble fraction frozen at -80°C for subsequent processing (see below). 

Solid ammonium sulfate was added to the supernatant to the point of 20% 
of saturation and the resulting solution loaded onto a 700 mL phenyl sepharose fast 
flow ("Q-Sepharose FF') column (Pharmacia Biotech., 800 Centennial Ave., P. 
O. Box 1327, Piscataway, NJ 08855). Prior to loading, the sepharose column 

30 was equilibrated with 50 mM Tris-HCl buffer (pH 7.6 at 4°C), 5 mM CaCl2, and 
1 M (NH4)2S04. The loaded column was eluted with a linear gradient of 
decreasing concentrations of aqueous (NH4)2S04 (from 1 down to 0 M) and 
increasing concentrations of aqueous CaCl2 (from 5 to 20 mM) in Tris-HCl buffer 
at pH 7.6. 
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The active fractions of eluate were collected and concentrated in an Amicon 
stirred cell (Amicon, Inc., 72 Cherry Hill Drive, Beverly, MA 01915). The 
concentrated sample was dialyzed overnight in the starting buffer used with the Q- 
Sepharose FF column, 50 mM Tris-HCl (pH 8.2 at 4°C) with 10 mM CaCl2. 
5 The dialyzed sample was then loaded on the Q-Sepharose FF column and 

eluted with a linear gradient comprising the starting buffer and 200 mM NaCl. The 
purified soluble fraction of the isotopically-labeled stromelysin fragment was 
concentrated and stored at 4°C 

The pellet was solubilized in 8M guanidine-HCl. The solution was 

10 centrifuged for 20 minutes at 20,000 rpm and the supernatant was added dropwise 
to a folding buffer comprising 50 mM Tris-HCl (pH 7.6), 10 mM CaCl2 0.5 mM 
ZnCl2 and the inhibitor cocktail of AEBSF, Leupeptin®, Aprotinin®, and 
Pepstatin® (all at concentrations of 1 ng/mL). The volume of folding buffer was 
ten times that of the supernatant. The mixture of supernatant and folding buffer 

15 was centrifuged at 20,000 rpm for 30 minutes. 

The supernatant from this centrifugation was stored at 4*C and the pellet 
was subjected twice to the steps described above of solubilization in guanidine- 
HCl, refolding in buffer, and centrifugation. The final supernatants from each of 
the three centrifugations were combined and solid ammonium sulfate was added to 

20 the point of 20% saturation. The resulting solution thus derived from the insoluble 
fraction was subjected to purification on phenyl Sepharose and Q-Sepharose as 
described above for the soluble fraction. 

The purified soluble and insoluble fractions were combined to produce 
about 1.8 mg of purified isotopically-labeled stromelysin 81-256 fragment per 

25 gram of original cell paste. 

B. Human papillomavirus (HPV) E2 Inhibitors 

The papillomaviruses are a family of small DNA viruses that cause genital 
warts and cervical carcinomas. The E2 protein of HPV regulates viral transcription 

30 and is required for viral replication. Thus, molecules that block the binding of E2 
to DNA may be useful therapeutic agents against HPV. The protein rather than the 
DNA was chosen as a target, because it is expected that agents with greater 
selectivity would be found that bind to the protein rather than the DNA. 

The DNA-binding domain of human papillomavirus E2 was cloned from 

35 the full length DNA that codes for E2 using PCR and overexpressed in bacteria 
using the T7 expression system. Uniformly ^N-labeled protein was isolated 
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from bacteria grown on a minimal medium containing ^N-labeled protein was 
isolated from bacteria grown on a minimal medium containing ^N-labeled 
ammonium chloride. The protein was purified from the bacterial cell lysate using 
an S-sepharose FastFlow column pre-equilibrated with buffer (50 mM Tris, 100 

5 mM NaCl, 1 mM EDTA, pH = 8.3). 

The protein was eluted with a linear gradient of 100-500 mM NaCl in 
buffer, pooled, and applied to a Mono-S column at a pH = 7.0. The protein was 
eluted with a salt gradient (100-500 mM), concentrated to 03 mM, and exchanged 
into a TRIS (50 mM, pH = 7.0 buffered H2O/D2O (9/1) solution containing 

10 sodium azide (0.5%). 
• 

C. RAF 

Uniformly ^N-labeled Ras-binding domain of the RAF protein was 
prepared as described in Emerson et al., Biochemistry . 34 (21): 691 1-6918 
15 (1995). 



D. FKBP 

Uniformly ^N-labeled recombinant human FK binding protein (FKBP) 
was prepared as described in Logan et al., J. Mol. Biol. . 236: 637-648 (1994). 

20 

Example 2 

Screening Compounds Using Two-Dimensional ^nAh NMR Correlation 
Spectral Analysis 

The catalytic domain of stromelysin was prepared in accordance with the 
25 procedures of Example 1 . The protein solutions used iir the screening assay 
contained the uniformly ^N-labeled catalytic domain of stromelysin (0.3 mM), 
acetohydroxamic acid (500 mM), CaCl2 (20 mM), and sodium azide (0.5%) in a 
H2O/D2O (9/1) TRIS buffered solution (50 mM, pH=7.0). 

Two-dimensional *^N/*H NMR spectra were generated at 29° C on a 
30 Bruker AMX500 NMR spectrometer equipped with a triple resonance probe and 
Bruker sample changer. The 15 N/*H HSQC spectra were acquired as 80 x 1024 
complex points using sweep widths of 2000 Hz ( 15 N, t 1 ) and 8333 Hz ( ! H, t2). 
A delay of 1 second between scans and 8 scans per free induction decay(fid) were 
employed in the data collection. All NMR spectra were processed and analyzed on 
35 Silicon Graphics computers using in-house- written software. 
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A first two-dimensional N/ H NMR correlation spectrum was acquired 
15 

for the N-labeled stromelysin target molecule as described above. The 
stromelysin target was then exposed to a database of test compounds. Stock 
solutions of the compounds were made at 100 mM and 1 M. In addition, a 
combination library was prepared that contained 8-10 compounds per sample at a 
concentration of 100 mM for each compound. 

The pH of the 1 M stock solution was adjusted with acetic acid and 
ethanolamine so that no pH change was observed upon a 1/10 dilution with a 100 
mM phosphate buffered solution (pH = 7.0). It is important to adjust the pH, 
because small changes in pH can alter the chemical shifts of the biomolecules and 
complicate the interpretation of the NMR data. 

The compounds in the database were selected on the basis of size 
(molecular weight = 100-300) and molecular diversity. The molecules in the 
collection had different shapes (e.g., flat aromatic rings(s), puckered aliphatic 
rings(s), straight and branched chain aliphatics with single, double, or triple 
bonds) and diverse functional groups (e.g., carboxylic acids, esters, ethers, 
amines, aldehydes, ketones, and various heterocyclic rings) for maximizing the 
possibility of discovering compound that interact with widely diverse binding 
sites. 

The NMR samples were prepared by adding 4 |ii of the DMSO stock 
solution of the compound mixtures that contained each compound at a 
concentration of 100 mM to 0.4 ml H2O/D2O (9/1) buffered solution of the 
uniformly ^N-labeled protein. The final concentration of each of the compounds 
in the NMR sample was about 1 mM. 

In an initial screen, two compounds were found that bind to the catalytic 
domain of stromelysin. Both of these compounds contain a biaryl moiety. Based 
on these initial hits, structurally similar compounds were tested against 
stromelysin. The structure of those biaryl compounds is represented by the 
structure I, below. (See Table 1 for definitions of R1-R3 and A1-A3). 



In the second round of screening, binding was assayed both in the absence 
and in the presence of saturating amounts of acetohydroxamic acid (500 mM). 
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Many of the biaryl compounds were found to bind the catalytic domain of 
stromelysin. FIG. 4 shows a representative two-dimensional *^N/*H NMR 
correlation spectrum before and after exposure of stromelysin to a biaryl test 
5 compound. It can be seen from FIG. 4 that the compound caused chemical shifts 
of 15 N-sites such as those designated W124, T187, A199 and G204. 

These sites correspond to a tryptophan (Trp) residue at position 124, a 
threonine (Thr) at position 1 87, an alanine (Ala) at position 199, and a glycine 
(Gly) at position 204 of SEQ ID NO. 1. FIG. 9 shows the correlation between the 
10 NMR binding data and a view of the NMR-derived three-dimensional structure of 
the catalytic domain of stromelysin. The ability to locate the specific binding site 
of a particular ligand is an advantage of the present invention. 

Some compounds only bound to stromelysin in the presence of 
hydroxamic acid. Thus, the binding affinity of some compounds was enhanced in 
15 the presence of the hydroxamic acid (i. e. cooperative). These results exemplify 
another important capability of the present screening assay: the ability to identify 
compounds that bind to the protein in the presence of other molecules. 

Various biaryl compounds of structure I were tested for binding to 
stromelysin at differing concentrations. The *^N/*H spectra generated at each 
20 concentration were evaluated to quantify differences in the spectra as a function of 
compound concentration, A binding or dissociation constant (Ko)was calculated, 
using standard procedures well known in the art, from those differences. The 
results of this study are shown in Table 1 . The values for R1-R3 and A1-A3 in 
Table 1 refer to the corresponding positions in the structure I, above. 

25 

Table 1 



Compound 
No. 


Ri 


R2 


R3 


Ai 


A 2 


A3 


Kp(mM) 


1 


H 


OH 


H 


C 


C 


c 


1.1 


2 


CH2OH 


H 


H 


C 


C 


c 


3.2 


3 


Br 


H 


OH 


C 


c 


c 


1.3 


4 


H 


H 


H 


N 


N 


c 


1.6 


5 


CHO 


H 


H 


C 


c 


c 


1.7 


6 


OCH3 


NH2 


H 


C 


c 


c 


0.4 
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7 


H 


H 


H 


N 


c 


c 


0.2 


8 


OCOCH3 


H 


H 


C 


c 


c 


0.3 


9 


OH 


H 


OH 


c 


c 


c 


0.16 


10 


H 


H 


H 


N 


c 


N 


0.4 


11 


OH 


H 


H 


C 


c 


c 


0.3 


12 


OH 


H 


CN 


c 


c 


c 


0.02 



The data in Table 1 show the utility of a process of the present invention in 
determining dissociation or binding constants between a ligand and a target 
molecule. 

5 Another advantage of an NMR screening assay of the present invention is 

the ability to correlate observed chemical shifts from the two-dimensional *^N/*H 
NMR correlation spectra with other spectra or projections of target molecule 
configuration. The results of a representative such correlation are shown in FIG. 
9, which depicts regions within the polypeptide at which binding with the substrate 

10 molecule is most likely occurring. In this Figure, the apparent binding regions in 
stromelysin are shown for Compound 1 (from Table I). 

Compounds from the database were screened in a similar manner for 
binding to the DN A- binding domain of the E2 protein. Those compounds had the 
structure H below, where R1-R4 and A are defined in Table 2. 

15 




NMR experiments were performed at 29*C on a Bruker AMX500 NMR 
spectrometer equipped with a triple resonance probe and Bruker sample changer. 
20 The ***N-/*H HSQC spectra were acquired as 80 x 1024 complex points using 
sweep widths of 2000 Hz ( 15 N,q ) and 8333 Hz (* H, t2). A delay of 1 second 
between scans and 4 scans per free induction decay were employed in the data 
collection* All NMR spectra were processed and analyzed on Silicon Graphics 
computers. 

25 FIGs. 2 and 3 show representative two-dimensional *^N/*H NMR 

correlation spectra before and after exposure of the DNA-binding domain of E2 to 
a first and second test compound, respectively. 
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It can be seen from FIG. 2 that the first test compound caused chemical 
shifts of 15 N-sites such as those designated 115, Y21, R22 and L23. Those sites 
correspond to an isoleucine (De) residue at position 15, a tyrosine residue (Tyr) at 
position 21, an arginine (Arg residue at position 22 and a leucine (Leu) residue at 
position 23 of SEQ ID NO. 6. 

Jt can be seen from FIG. 3 that the second test compound caused chemical 
shifts in the particular 15 N-sites designated 16, Gl 1, H38, and T52. Those sites 
correspond to an isoleucine (He) residue at position 6, a glycine (Gly) residue at 
position 1 1, a histidine (His) residue at position 38 and a threonine (Thr) at 
position 52 of SEQ ID NO. 6. 

FIGs. 7 and 8 show the correlation between those NMR binding data and a 
view of the NMR-derived three-dimensional structure of the DNA-binding domain 
of E2. 

Several structurally similar compounds caused chemical shift changes of 
the protein signals when screened at a concentration of 1 mM. Two distinct sets of 
amide resonances were found to change upon the addition of the compounds: one 
set of signals corresponding to amides located in the B-barrel formed between the 
two monomers and a second set corresponding to amides located near the DNA- 
binding site. 

For example* compounds containing two phenyl rings with a carboxylic 
acid attached to the carbon linking the two rings only caused chemical shift 
changes to the amides in the DNA-binding site. In contrast, benzophenones and 
phenoxyphenyl-containing compounds only bound to the B-barrel. Other 
compounds caused chemical shift changes of both sets of signals but shifted the 
signals in each set by different amounts, suggesting the presence of two distinct 
binding sites. 

By monitoring the chemical shift changes as a function of ligand 
concentration, binding constants for the two binding sites were also measured. 
The results of those studies are summarized below in Table 2. 
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Table 2 



Comp. 
No. 


A 


Ri 


R 2 


*3 


R4 


DNA 
Ko(mM) 


(^barrel 
KD(mM) 


Filter 
binding 

assay 


13 


CO 


H 


H 


H 


OH 


>50 


0.6 




14 


o 


H 


H 


H 


CH 2 OH 


>50 


2.0 




15 


.a 


H 


H 


COO 


H 


2.0 


>50 


+ 


16 


_a 


Ci 


Ci 


coo 


H 


0.1 


>50 




17 


.a 


H 


H 


CH2COO 


H 


4.2 


4.9 


+ 


18 


_a 


H 


H 


CH=CHCOO 


H 


1.2 


6.2 


+ 


19 


0 


H 


H 


CH 2 CH 2 CH(CH 3 ) 
-CH2COO 


H 


0.5 


0.2 




20 


0 


H 


H 


COCH2CH2COO 


H 


2.7 


4.8 





a a dash (-) for A indicates no atom (i.e. byphenyl linkage) 



5 Uniformly N-labeled Ras-binding domain of the RAF protein was 

prepared as described in Example 1 and screened using two-dimensional *^N/*H 
NMR correlation spectral analysis in accordance with the NMR procedures 
described above. The results of a representative study are shown in FIG. 5, which 
depicts two-dimensional *^N/*H NMR correlation spectra both before and after 

1 o exposure to a test compound. 

Uniformly * 5 N-labeled FKBP was prepared as described in Example 1 
and screened using two-dimensional *^N/*H NMR correlation spectral analysis in 
accordance with the NMR procedures described above. The results of a 
representative study are shown in FIG. 6, which depicts two-dimensional *^N/*H 

15 NMR correlation spectra both before and after exposure to a test compound. 

Example 3 

Comparison of NMR. Enzymatic. Filter Binding and Gel Shift Screening Assays 
Studies were performed to compare binding constants of ligands to various 
20 biomolecules, determined by the NMR method of the present invention, to similar 
results obtained from prior art methods. 

In a first study, binding constants were determined, both by the NMR 
method of the present invention, and by a prior art enzymatic assay. The target 
molecule was the catalytic domain of stromelysin prepared in accordance with the 



SUBSTITUTE SHEET (RULE 265 



WO 97/18471 



PCT/US96/18270 



27 

procedures of Example 1 . The NMR binding constants, Kd, were derived using 
two-dimensional *^N/*H NMR correlation spectroscopy as described in Example 
2. The Kd values so obtained were compared to an inhibition constant Ki as 

determined in an enzymatic assay. 
5 The enzymatic assay measured the rate of cleavage of a fluorogenic 

substrate by following the fluorescence increase upon peptide cleavage which 
causes a separation between the fluorophore and quencher. Enzymatic activity was 
measured using a matrix of different concentrations of acetohydroxamic acid and 
biaryl compounds. The assay is a modification of the method described by H. 
10 Weingarten, et aL in Anal. Biocheny 147: 437-440 (1985) employing the 

fluorogenic substrate properties described by E. Matayoshi, et aL in Science : 247: 
954-958 (1990). 

Eight acetohydroxamic acid concentrations were used ranging from 0.0 to 
1.0 M y and six compound concentrations weTe used, resulting in a total of 48 
1 5 points. Individual compound concentration varied due to solubility and potency. 

All NMR measurements were performed in the presence of 500 mM 
acetohydroxamic acid, except for the titration of acetohydroxamic acid itself. 
Dissociation constants were obtained from the dependence of the observed 
chemical shift changes upon added ligand. Inhibition constants were then obtained 
20 from the inhibition data using standard procedures. 

The results of these studies are summarized below m Table 3, which 
shows the comparison of NMR-derived dissociation constants (Kd) with 
inhibition constants measured in the enzyme assay (Kj) using a fluorogenic 
substrate. 

25 

Table 3 



Compound 
No. 


NMRKD 

(mM) 


Assay K] 1 
(mM) 


4 


1.6 


7.4 


7 


0.17 


0.32 


9 


0.16 


0.70 J 


10 


0.40 


1.8 | 


12 


0.02 


0.11 | 


Acetohydroxamic acid 


17.0 


21.1 I 
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The data in Table 3 show that a NMR process of the present invention 
provides a rapid, efficient and accurate way of determining dissociation or binding 
constants of ligands to target biomolecules. Comparison of the binding constants 
determined by the two methods result in the same ranking of potencies of the 
compounds tested. That is, while the values for a given substrate as determined by 
the two methods are not equal, they are proportional to one another. 

In a second study, the results for binding of the DNA-binding domain of 
E2 to its target DNA were obtained by prior art methods and compared with results 
obtained by the method of the present invention. The target was the DNA-binding 
domain of E2, prepared in accordance with the procedures of Example 1 . NMR 
screening assays and NMR processes for determining ligand dissociation constants 
were performed as set forth above in Example 2. 

The binding constant from the NMR process was compared to the results 

of a physical, filter binding assay that measured binding of DNA to the target. The 

high-throughput filter binding assay was performed using E2, prepared according 

33 

to Example 2 above. The P-labeled DNA construct comprised a 10,329 base 
pair plasmid formed by inserting the HPV-1 1 genome, containing three high 
affinity and one low affinity E2 binding sites, into the PSP-65 plasmid (Promega, 
Madison, WI), 

The binding affinities at the different sites as determined by NMR were 
compared for a subset of the compounds to the inhibition of E2 binding to DNA as 
measured in the filter binding assay. As shown in Table 2 above, the activities 
determined in the filter binding assay correlated closely with the binding affinities 
calculated from the amides of the DNA-binding site-but not to the affinities 
measured for the B-barrel site. This is consistent with the relative locations of each 
site. 

In an alternative study, a comparison of the NMR-determined binding 
results was made with similar results obtained by a prior art gel-shift assay using 
techniques well known in the art The gel-shift assay was performed using a GST 
fusion protein which contained full length E2 and a 33 P-labeled 62 base pair DNA 
fragment containing two E2 binding sites. 

The method identified numerous compounds which gave positive results in 
the gel-shift assay. Some of these positive results, however, were believed to be 
due to binding to the DNA, since in these cases, no binding to the E2 protein was 
observed using the NMR method of this invention. These compounds were 
shown to indeed bind to DNA rather than to E2, as evidenced by changes in the 
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chemical shifts of the DN A rather than the protein upon the addition of the 
compounds. These data show that yet another advantage of the present invention 
is the ability to minimize the occurrence of false positives. 

Example 4 

Design of a potent non-peptide inhibitor of stromelvsin 

Studies were performed to design new ligands that bound to the catalytic 
domain of stromelysin. Because stromelysin undergoes autolysis, an inhibitor 
was sought to block the degradation of stromelysin. That inhibitor would facilitate 
the screening of other potential ligands that bind to other sites on the enzyme. 

The criteria used in selecting compounds in the screening for other binding 
sites was based primarily on the size of the ligand. The smallest ligand was sought 
that had enough solubility to saturate (>98% occupancy of enzyme) and inhibit the 
enzyme. 

The cloning, expression, and purification of the catalytic domain of 
stromelysin was accomplished using the procedures set forth in Example L An 
initial step in the design of the new ligand was the identification of a first ligand 
that bound to the stromelysin target. Such identification was carried out in 
accordance with a two-dimensional *^N/*H NMR correlation screening process as 
disclosed above. 

A variety of hydroxamic acids of the general formula R-(CO)NHOH were 
screened for binding to stromelysin using the procedures set forth in Example 2. 
Of the compounds tested, acetohydroxamic acid [CH3(CO)NHOH] best satisfied 
the selection criteria: it had a binding affinity for stromelysin of 17 mM and had 
good water solubility. At a concentration of 500 mM, acetohydroxamic acid 
inhibited the degradation of the enzyme, allowing the screening of other potential 
ligands. 

The second step in the design process was the identification of a second 
ligand that bound to the target stromelysin at a site different from the binding site 
of acetohydroxamic acid. This was accomplished by screening compounds for 
their ability to bind stromelysin in the presence of saturating amounts of 
acetohydroxamic acid. Details of procedures and results of this second 
identification step are set forth above in Example 2. 

The compound identified as a second ligand from these studies and used in 
subsequent design steps was the compound designated as Compound #4 in Table 
1 (See Example 2). 
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The next step in the design process was to construct a ternary complex of 
the target stromelysin, the first ligand and the second ligand. This was 
accomplished by exposing the stromelysin target to the two Hgands under 
conditions that resulted in complex formation. The three-dimensional structure of 
5 the ternary complex was then determined using NMR spectroscopy as described 
below. 

The ] H, 13 C, and 15 N backbone resonances of stromelysin in the ternary 
complex were assigned from an analysis of several 3D double- and triple- 
resonance NMR spectra (A. Bax. et aL Acc. Chem. Res. , 26: 131-138 (1993)). 
o The C a resonances of adjacent spin systems were identified from an analysis of 
three-dimensional (3D) HNCA (L. Kay et aL J. Mapn. Reson. . 89: 496-514 
(1990)) and HN(CO)CA (A. Bax, et aL J. Bio. NMR . 1: 99 (1991)) spectra 
recorded with identical spectral widths of 1773 Hz (35.0 ppm), 3788 Hz (30.1 
ppm), and 8333 Hz (16.67 ppm) in the Fi( 15 N), F2( 13 C) and F3( ] H) 
dimensions, respectively. 

The data matrix was 38(t]) x 48(t2) x 1024(t3) complex points for the 
HNCA spectrum, and 32(ti) x 40(t2) x 1024(t3) complex points for the 
HN(CO)CA spectrum. Both spectra were acquired with 16 scans per increment. A 
3D CBCA(CO)NH spectrum (S. Grzesiek, et aL J. Am. Chem, Soc 114: 6261- 
6293 (1992)) was collected with 32(t], 15 N) x 48(t2> 13 C) x 1024(t3, *H) 
complex points and 32 scans per increment. Spectral widths were 1773 Hz (35.0 
ppm), 7575.8 Hz (60.2 ppm), and 8333 Hz (16.67 ppm) in the 15 N, 13 C and *H 
dimensions, respectively. 

For aU three spectra, the *H carrier frequency was set on the water 

15 * 13 

resonance and the N carrier frequency was at 1 1 9. 1 ppm. The C carrier 

frequency was set to 55.0 ppm in HNCA and HN(CO)CA experiments, and 46.0 

ppm in the CBCA(CO)NH experiment. 

The backbone assignments were confirmed from an analysis of the 

crosspeaks observed in an ^N-separated 3D NOESY-HSQC spectrum and a 3D 

HNHA-J spectrum. The 15 N- separated 3D NOESY-HSQC spectrum (S. Fesik, 

et aL J. Magn. Reson. , 87: 588-593 (1988)); D. Marion, et aL I. Am. Chem. 

Soc. . Ill: 1515-1517 (1989)) was collected with a mixing time of 80 ms. A total 

of 68(tl, 15 N) x 96(t2, *H) x 1024(t3, *H) complex points with 16 scans per 

increment were collected, and the spectral widths were 1773 Hz (35.0 ppm) for the 

15 N dimension, 6666.6 Hz (t2, ] H, 13.3 ppm), and 8333 Hz (16.7 ppm) for the 

' H dimension. 
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The 3D HNHA-J spectrum (G. Vuister, et al y J. Am. Chem. Soc. . 115: 

7772-7777 (1993)), which was also used to obtain 3 JHNHa coupling constants, 

was acquired with 35(ti, 15 N) x 64(t2, 'h) x 1024(t3, *H) complex points and 

32 scans per increment. Spectral widths and carrier frequencies were identical to 

5 those of the I5 N-separated NOESY-HSQC spectrum. Several of the signals 

were assigned using the HNHB experiment. The sweep widths were the same as 

in the 15 N- separated NOESY-HSQC spectrum that was acquired with 32(q, 15 N) 

x 96(t2, 1 H) x 1024 (t3, J H) complex points. 
1 13 

The H and C chemical shifts were assigned for nearly all sidechain 

1 0 resonances. A 3D HCCH-TOCS Y spectrum (L. Kay, et a/., J. Magn. Reson. . 

101b: 333-337 (1993)) was acquired with a mixing time of 13 ms using the 

DIPSI-2 sequence (S. Rucker, et aL y Mol. Phvs. . 68: 509 (1989)) for 13 C 

isotropic mixing. A total of 96 (ti, 13 C) x 96(t2, *H) x 1024(t3, J H) complex 

data points were collected with 16 scans per increment using a spectral width of 

15 10638 Hz (70.8 ppm, wi), 4000 Hz (6.67 ppm, W2), and 4844 (8.07 ppm, W3). 

1 3 

Carrier positions were 40 ppm, 2.5 ppm, and at the water frequency for the C, 
indirectly detected *H, and observed *H dimensions, respectively. 

Another 3D HCCH-TOCSY study was performed with the C carrier at 
122.5 ppm to assign the aromatic residues. The spectra were collected with 
20 36(ti, 3 C) x 48(t2,*H) x 1024 (t3,*H) complex points with spectral widths of 
5263 Hz (35.0 ppm, wi), 3180 Hz (5.30 ppm, W2), and 10,000 (16.7 ppm, W3). 

Carrier positions were 122.5 ppm, 7.5 ppm, and at the water frequency for the 

13 11 
C, indirectly detected H, and observed H dimensions, respectively. 

A 13 C- separated 3D NOESY-HMQC spectrum (S. Fesik, et a/., J. Mag n. 

25 EssaiL, 87: 588-593 (1988)); D. Marion, et a/., J. Am.-Chem. Sac. Ill: 1515- 

1517 (1989)) was recorded using a mixing time of 75 ms. A total of 80 (t] , I3 C) 

x 72 (t2> *H) x 1024 (t3, *H) complex data points with 16 scans per increment 

were collected over spectral widths of 10638 Hz (70.49 ppm, wi), 6666.6 Hz 

(13.3 ppm, W2), and 8333.3 Hz (16.67 ppm, W3). The *H carrier frequencies 

13 

30 were set to the water resonance, and the C carrier frequency was placed at 40.0 
ppm. 

Stereospecific assignments of methyl groups of the valine and leucine 
residues were obtained by using a biosynthetic approach (Neri et a/., Biochem. . 
28: 7510-7516 (1989)) on the basis of the I3 C- 13 C one-bond coupling pattern 
35 observed in a high-resolution *H, 13 C-HSQC spectrum (G. Bodenhausen, et al. 9 
J- CheiDr Phys, Lett,, 69: 185-189 (1980)) of a fractionally 13 C-labeled protein 
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13 1 

sample. The spectrum was acquired with 200( C, t\ ) x 2048( H, t2) complex 

points over spectral widths of 5000 Hz (39.8 ppm, I3 C) and 8333 Hz (16.7 ppm, 
1 13 

H). Carrier positions were 20.0 ppm for the C dimension, and at the water 
frequency for the 'h dimension. 

12 

5 To detect NOEs between the two ligands and the protein, a 3D C- 

13 

filtered, C-edited NOESY spectrum was collected. The pulse scheme consisted 
13 

of a double C-filter sequence (A. Gemmeker, et al y J. Maen. Reson. . 96; 199- 
204 ( 1 992)) concatenated with a NOES Y-HMQC sequence (S. Fesik, et a/., L 
Maen. Reson. . 87: 588-593 (1988)); D. Marion, et al. y J. Am. Chem. Soc . Ill: 

10 1 5 1 5- 1 5 1 7 ( 1 989)) . The spectrum was recorded with a mixing time of 80 ms, 
and a total of 80 (tj, 13 C) x 80 (t2, *H) x 1024 (13, *H) complex points with 16 
scans per increment. Spectral widths were 8865 Hz (17.73 ppm, w]), 6667 Hz 
(13.33 ppm, w2), and 8333 Hz (16.67 ppm, W3), and the carrier positions were 
40.0 ppm for the carbon dimension and at the water frequency for both proton 

15 dimensions. 

To identify amide groups that exchanged slowly with the solvent, a series 
of ] H, 15 N-HSQC spectra (G. Bodenhausen, et al. y J. Chem. Phvs. Lett. . 69: 
185-189 (1980)) were recorded at 25*C at 2 hr intervals after the protein was 
exchanged into D2O. The acquisition of the first HSQC spectrum was started 2 

20 hrs. after the addition of D2O. 

All NMR spectra were recorded at 25°C on a Bruker AMX500 or AMX600 
NMR spectrometer. The NMR data were processed and analyzed on Silicon 
Graphics computers. In all NMR experiments, pulsed field gradients were applied 
where appropriate as described (A. Bax, et al. y J. Magn. Reson. . 99: 638 (1992)) 

25 to afford the suppression of the solvent signal and spectral artifacts. Quadrature 
detection in indirectly detected dimensions was accomplished by using the States- 
TPPI method (D. Marion, era/.. J. Am, Chem. Soc , 111: 1515-1517 (1989)). 
Linear prediction was employed as described (E. Olejniczak, et ai, J. Magn. 
Reson>. 87: 628-632 (1990)). 

30 The derived three-dimensional structure of the ternary complex was then 

used to define the spatial orientation of the first and second ligands to each other as 
well as to the target stromelysin molecule. 

Distance restraints derived from the NOE data were classified into six 
categories based on the NOE cross peak intensity and given a lower bound of 1.8 

35 A and upper bounds of 2.5 A, 3.0 A, 3.5 A, 4.0 A, 4.5 A, and 5.0 A, 

respectively. Restraints for <{> torsional angles were derived from JHNHa 
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coupling constants measured from the 3D HNH A-J spectrum (G. Vuister, et a/., J. 
Am. Chem. Soc . 115: 7772-7777 (1993)). The <|> angle was restrained to 
120%±40% for 3 JHNHa > 8.5 Hz, and 60%±40% for 3 JHNHa < 5 Hz. 

Hydrogen bonds, identified for slowly exchanging amides based on initial 
5 structures, were defined by two restraints: 1.8-2.5 A for the H-0 distance and 1.8- 
3.3 A for the N-O distance. Structures were calculated with the X-PLOR 3. 1 
program (A. Briinger, "XPLOR 3.1 Manual," Yale University Press, New Haven, 
1992) on Silicon Graphics computers using a hybrid distance geometry-simulated 
annealing approach (M. Nilges, et a/., FEBS Lett. . 229: 317-324 (1988)). 
io A total of 1032 approximate interproton distance restraints were derived 

^from the NOE data. In addition, 21 unambiguous intermolecular distance 
restraints were derived from a 3D 12C-filtered, 13C-edited NOESY spectrum. Of 
the 1032 NOE restraints involving the protein, 341 were intra-residue, 410 were 
sequential or short-range between residues separated in the primary sequence by 
1 5 less than five amino acids, and 28 1 were long-range involving residues separated 
by at least five residues. 

In addition to the NOE distance restraints, 14 § dihedral angle restraints 

were included in the structure calculations that were derived from three-bond 
3 

coupling constants ( JHNHa) determined from an HNHA-J spectrum (G. 

20 Viioster, et al, J. Am. Chem. Soc . 115: 7772-7777 (1 993)). The experimental 
restraints also included 120 distance restraints corresponding to 60 hydrogen 
bonds. The amides involved in hydrogen bonds were identified based on their 
characteristically slow exchange rate, and the hydrogen bond partners from initial 
NMR structures calculated without the hydrogen bond restraints. The total number 

25 of non-redundant, experimentally-derived restraints was 1 166. 

The structures were in excellent agreement with the NMR experimental 
restraints. There were no distance violations greater than 0.4 A, and no dihedral 
angle violations greater than 5 degrees. In addition, the simulated energy for the 
van der Waals repulsion term was small, indicating that the structures were devoid 

30 of bad inter-atomic contacts. 

The NMR structures also exhibited good covalent bond geometry, as 
indicated by small bond-length and bond-angle deviations from the corresponding 
idealized parameters. The average atomic root mean square deviation of the 8 
structures for residues 93-247 from the mean coordinates was 0.93 A for 

35 backbone atoms (C a , N, and C), and 1.43 A for all non-hydrogen atoms. 
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10 



15 



A ribbon plot of the ternary complex involving stromelysin, 
acetohydroxamic acid (the first ligand), and the second ligand is shown in Fig 10. 
The structure is very similar to the global fold of other matrix metal loproteinases 
and consists of a five-stranded B-sheet and three a-helices. 

The catalytic zinc was located in the binding pocket It was coordinated to 
three histidines and the two oxygen atom of acetohydroxamic acid. A biaryl group 
of the second ligand was located in the S V pocket between the second helix and 
the loop formed from residues 218-223. This deep and narrow pocket is lined 
with hydrophobic residues which make favorable contacts with the ligand. 

Based on the three-dimensional structure of the ternary complex as 
determined above and the structure/activity relationships observed for the binding 
to stromelysin of structural analogs of the second ligand (i.e., other biaryl 
compounds), new molecules were designed that linked together the 
acetohydroxamic acid to biaryls. 

As shown in Table 4 below, the initial biaryls chosen contained an oxygen 
linker and the absence or presence of CN para to the biaryl linkage. Initial linkers 
contained varying lengths of methylene units. Means for linking compounds with 
linkers having varying lengths of methylene units are well known in the art 



Table 4 



H 




R 
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Compound 


X 


R 


Stromelysin 
Inhibition 


2J 


(CH 2 ) 2 


H 


0.31 M.M 






o 
H 


i in nU 


23 


(CH 2 ) 4 


H 


38%@100^M 


24 


(CH 2 )5 


H 


43%@100iiM 


25 


(CH 2 )2 


CN 


0.025 uM 


26 


(CH 2 h 


CN 


3.4 uM 


27 


(CH 2 ) 4 


CN 


3.5 \iM 


28 


(CH 2 ) 5 


CN 


1.7 uM 



As expected based on the better binding of the CN substituted biaryls to 
stromelysin, the CN derivatives exhibited better stromelysin inhibition. The 
5 compound that exhibited the best inhibition of stromelysin contained a linker with 
two methylene units. 

The present invention has been described with reference to preferred 
embodiments. Those embodiments are not limiting of the claims and specification 
in any way. One of ordinary skill in the art can readily envision changes, 
10 modifications and alterations to those embodiments that do not depart from the 
scope and spirit of the present invention. 
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SEQUENCE LISTING 



5 (1) GENERAL INFORMATION: 

fi) APPLICANT: Fesik, Stephen W. 

Hajduk, Philip J. 

10 (ii) TITLE OF INVENTION: Use of Nuclear Magnetic Resonance to 

Identify Ligands to Target Biomolecules 

(iii) NUMBER OF SEQUENCES: 6 

15 (iv) CORRESPONDENCE ADDRESS : 

{A) ADDRESSEE: Steven F. Weinstock, Dept. 377 AP6D, 
Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 
20 (D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

(v) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Janssen, Jerry F. 

(B) REGISTRATION NUMBER: 29,175 

<ix) TELECOMMUNICATION INFORMATION: " 
40 (A) TELEPHONE: (708) 937-4558 

(B) TELEFAX: (708) 938-7742 

(2) INFORMATION FOR SEQ ID NO:l: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



50 



55 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO;l: 



Phe Arg Thr Phe Pro Gly lie Pro Lys Trp Arg Lys Thr His Leu Thr 
15 10 15 

Tyr Arg lie Val Asn Tyr Thr Pro Asp Leu Pro Lys Asp Ala Val Asp 



SUBSTITUTE SHEET (RULE 26) 



WO 97/18471 PCT/US96/18270 

37 

20 25 30 

Ser Ala Val Glu Lys Ala Leu Lys Val Trp Glu Glu Val Thr Pro Leu 
35 40 45 

5 

Thr Phe Ser Arg Leu Tyr Glu Gly Glu Ala Asp lie Met lie Ser Phe 
50 55 60 

Ala Val Arg Glu His Gly Asp Phe Tyr Pro Phe Asp Gly Pro Gly Asn 
10 65 70 75 80 

Val Leu Ala His Ala Tyr Ala Pro Gly Pro Gly lie Asn Gly Asp Ala 
85 90 95 

15 His Phe Asp Asp Asp Glu Gin Trp Thr Lys Asp Thr Thr Gly Thr Asn 
100 105 110 



20 



50 



£jeu Phe Leu Val Ala Ala His Glu lie Gly His Ser Leu Gly Leu Phe 
115 120 125 

His Ser Ala Asn Thr Glu Ala Leu Met Tyr Pro Leu Tyr His Ser Leu 
130 135 140 



Thr Asp Leu Thr Arg Phe Arg Leu Ser Gin Asp Asp lie Asn Gly lie 
25 145 150 155 160 

Gin Ser Leu Tyr Gly Pro Pro Pro Asp Ser Pro Glu Thr Pro 
165 170 

30 (2) INFORMATION FOR SEQ ID NO; 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Thr Thr Pro lie He His Leu Lys Gly Asp Ala Asn He Leu 
15 10 15 

45 Leu Cys Leu Arg Tyr Arg Leu Ser Lys Tyr Lys Gin Leu Tyr Glu Gin 
20 25 30 

Ser Val Ser Thr Trp His Trp Thr Cys Thr Asp Gly Lys His Lys Asn 
35 40 45 



Ala He Val Thr Leu Thr Tyr He Ser Thr Ser Gin Arg Asp Asp Phe 
50 55 60 



Leu Asn Thr Val Lys lie Pro Asn Thr Val Ser Val Ser Thr Gly Tyr 
55 65 70 75 80 

Met Thr He 

(2) INFORMATION FOR SEQ ID NO: 3: 
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10 



15 



25 



30 



40 



45 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

GAAATGAAGA GTCTTCAA 
18 

(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 18 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

GCGTCCCAGG TTCTGGAG 
18 

(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

ATACCATGGC CTATCCATTG GATGGAGC 
28 

(2) INFORMATION FOR SEQ ID NO;6: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 0 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 

ATAGGATCCT TAGGTCTCAG GGGAGTCAGG 
30 
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WHAT IS CLAIMED IS: 



] . A process of screening compounds to identify compounds that are ligands 
that bind to a specific target molecule comprising the steps of: 

a) generating a first two-dimensional *^N/*H NMR correlation 
spectrum of a ^N-labeled target molecule; 

b) exposing the labeled target molecule to one or a mixture of chemical 
compounds; 

c) generating a second two-dimensional *^N/*H NMR correlation 
spectrum of the labeled target molecule that has been exposed to 

one 

or a mixture of compounds in step (b); and 

d) comparing said first and second two-dimensional 1 ^N/* H NMR 
correlation spectra to determine differences between said first and 
said second spectra, the differences identifying the presence of one 
or more compounds that are ligands which have bound to the target 
molecule. 



2 . The process of claim 1 wherein the 15 N-labeled target molecule is 
exposed to a mixture of chemical compounds in step (b), further 
comprising the steps subsequent to step d) of 

e) exposing the 15 N-labeled target molecule individually to 
each compound of said mixture, 

f) generating a two-dimensional *^N/*H NMR correlation 
spectrum of the labeled target molecule that has been 
individually exposed to each compound; and 

g) comparing each spectrum generated in step 0 to said first 
spectrum to determine differences in any of those compared 
spectra, the differences identifying the presence of a 
compound that is a ligand which has bound to the target 
molecule. 

3 . The process of claim 1 wherein the differences in the two-dimensional 
*^N/*H NMR correlation spectra are chemical shifts at particular ^N- 
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labeled sites in the target molecule and chemical shifts in protons attached 
to those ^N-labeled sites. 



4. 



The process of claim 1 wherein the target molecule is a polypeptide. 



5 . A process of determining the dissociation constant between a target 

molecule and a ligand that binds to that target molecule comprising the 
steps of: 

a) generating a first two-dimensional * ^N/ 1 H NMR correlation 
5 * spectrum of a ^N-labeled target molecule; 

b) exposing the labeled target molecule to various concentrations of a 
ligand; 

c) generating a two-dimensional * ^N/ 1 H NMR correlation spectrum 
at each concentration of ligand from step (b); 

10 d) comparing each spectrum from step (c) both to the first spectrum 

from step (a) and to all other spectra from step (c) to quantify 
differences in those spectra as a function of changes in ligand 
concentration; and 
e) calculating the dissociation constant between the target molecule 

15 and the ligand from those differences according to the equation: 

K D == ([P] 0 -x)([L] D -x) 
x 

where [P]o is the total molar concentration of target 
molecule; 

20 [L]o is the total molar concentration of ligand; and 

x is the molar concentration of the bound species 

determined 

according to the equation: 
x = 

25 A 

where bobs and fifoe are the chemical shift values for the 

target molecule determined at each concentration of 
ligand and for the target molecule in the absence of 
ligand, respectively, and A is the difference between 
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the chemical shift at saturating amounts of ligand 
and Sfree- 



6 . The process of claim 5 wherein the target molecule is a polypeptide. 

7 . The process of claim 5 further comprising the step of binding the labeled 
target molecule to a second ligand before step (a). 
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